(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen dee brevets 



(12) 



(11) EP 0 321 362 B1 

EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and nnention 
of the grant of the patent: 
25.09.1996 Bulletin 1996/39 

(21) Application number: 88403229.3 

(22) Date of filing: 16.12.1988 



(51) intci.6: C12N 15/12, C12P 21/02, 
01 2Q 1/68, C07K 14/00 



(54) Retinoic acid receptor and derivatives thereof, DNA encoding either substance and use of the 
proteins and of the DNA 

Retinoesaurerezeptor und Derivate davon, beide Substanzen codierende DNS und die Verwendung 
der Proteine und der DNS 

Recepteur de I'acide retinoique et ses derivees, DNA codant pour ces deux substances et I'usage 
des proteines et des DNA 



(84) Designated Contracting States: 

AT BE OH DE ES FR GB GR IT LI LU NL SE 

(30) Priority: 16.12.1987 US 133687 

17.12.1987 US 134130 

20.06.1988 US 209009 
30.11.1988 US 278136 

(43) Date of publication of application: 
21.06.1989 Bulletin 1989/25 

(73) Proprietor: INSTITUT PASTEUR 
F-75715 Paris Cedex 15 (FR) 

(72) Inventors: 

• Tiollals, Pierre 
F-75013 Parls (FR) 

• Dejean, Anne 
F-75014 Paris (FR) 

• Blaudin de The, Hugues 
F-75003 Paris (FR) 

• Marchio, Agnes 
F-75011 Paris (FR) 

(74) Representative: Gutmann, Ernest et al 
Ernest Gutmann - Yves Plasseraud S.A. 
3, rue Chauveau-Lagarde 

75008 Paris (FR) 



m 

(O 
CO 

CO 

o 

Q. 
lU 



(56) References cited: 
EP-A-0 244 221 

• NATURE, vol. 333, 16th June 1988, pages 
669-672; D. BENBROOK et al.: "A new retinoic 
acid receptor Identified from a hepatocellular 
carcinoma" 

• NATURE, vol. 332, 28th April 1988, pages 
850-853; N. BRAND et al.: "Identification of a 
second human retinoic acid receptor" 

• NATURE, vol. 322, 3rd July 1986. pages 70-72; A. 
DEJEAN et al.: "Hepatitis B virus DNA 
Intergration In a sequence homologous to 
v-erb-A and steriod receptor genes in a 
hepatocellular carcinoma" 

• NATURE, vol. 330, 3rd December 1987, pages 
444-450; M. PETKOVICH etal.: "A human retinoic 
acid receptor which belongs to the family of 
nuclear receptors" 

• SCIENCE, vol. 240, 13th May 1988, pages 
889-895; R.M. EVANS: "The steroid and thyroid 
hormone receptor superfamily" 

• NATURE, vol. 330, 17th December 1987, pp. 
667-670; H. DE THE. 



Note: Within nine nnonths from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in 
a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art, 
99(1) European Patent Convention). 



Printed by Jouve, 75001 PARIS (FR) 



EP 0 321 362 B1 

Description 

BACKGROUND OF THE INVENTION 

5 This invention relates to nucleotide sequences, polypeptides encoded by the nucleotide sequences, and to their 

use in diagnostic and pharmaceutical applications. 

Primary hepatocellular carcinoma (HCC) represents the most common cancer, especially in young men, in many 
parts of the world (as in China and in much of Asia and Africa) (reviewed in Tiollais et al., 1985). Its etiology was 
investigated mostly by epidemiological studies, which revealed that, beyond some minor potential agents such as 
10 aflatoxin and sex steroids, hormones, Hepatitis B virus (HBV) chronic infection could account for a large fraction of 
liver cancers (Beasley and Hwang, 1984). 

HBV DNA has been found to be integrated in the genome of most cases of HCCs studied (Edman et al., 1980; 
Brechot et al., 1980; Chakraborty et al.. 1980; Chen et al., 1982). Nonetheless the role of those sequences in liver 
oncogenesis remains unclear. 

15 A single HBV integration in a HCC sample in a short liver cell sequence has been reported recently. The sequence 

was found to be homologous to steroid receptor genes and to the cellular proto-oncogene c-erbA (Dejean et al., 1 986). 

Ligand-dependent transcriptional activators, such as steroid or thyroid hormone receptors, have recently been 
cloned allowing rapid progress in the understanding of their mechanism of action. Nevertheless, there exists a need 
in the art for the identification of transcripts that may encode for activational elements, such as nuclear surface recep- 

20 tors, that may play a role in hepatocellular carcinoma. Such findings would aid in identifying corresponding transcripts 
in susceptible individuals. In addition, identification of transcripts could aid in elucidating the mechanisms by which 
HCC occurs. 

Retinoids, a class of compounds including retinol (vitamin A), retinoic acid (RA), and a series of natural and syn- 
thetic derivatives, exhibit striking effects on cell proliferation, differentiation, and pattern formation during development 

25 (Strickland and Mahdavi, 1978; Breitman et al., 1980; Roberts and Spom, 1984; Thaller and Eichele. 1987). Until 
recently, the molecular mechanism by which these compounds exert such potent effects was unknown, although retin- 
oids were thought to modify their target cells through a specific receptor. 

Except for the role of retinoids in vision, their mechanism of action is not well understood at the molecular level. 
Several possible mechanisms have been suggested. One hypothesis proposes that retinoids are needed to serve as 

30 the lipid portion of glycolipid intermediates involved in certain, specific giycosylation reactions. Another mechanism, 
which may account for the various effects of retinoids on target cells, is that they alter genomic expression in such 
cells. It has been suggested that retinoids may act in a manner analogous to that of the steroid hormones and that the 
intracellular binding proteins (cellular retinal-binding and retinoic acid-binding protein) play a critical part in facilitating 
the interaction of retinoids with binding sites in the cell nucleus. 

35 For example, the obsen/ation that the RA-induced differentiation of murine F9 embryonal carcinoma cells is ac- 

companied by the activation of specific genes has led to the proposal that RA, like the steroid and thyroid hormones, 
could exert its transcriptional control by binding to a nuclear receptor (Roberts and Spron, 1984). However, the bio- 
chemical characterization of this receptor had been hampered by high affinity RA-binding sites corresponding to the 
cellular retinoic acid binding protein (CRABP), which is thought to be a cytoplasmic shuttle for RA (Chytil and Ong, 

40 1 984). 

In any event, retinoids are currently of interest in dermatology. The search for new retinoids has identified a number 
of compounds with a greatly increased therapeutic index as compared with naturally occurring retinoids. Extensive 
clinical testing of two of these retinoids, 1 3-cis- retinoic acid and the aromatic analog etretinate, has lead to their clinical 
use in dermatology. In addition, several lines of evidence suggest that important relations exist between retinoids and 

45 cancer. A , umber of major diseases, in addition to cancer, are characterized by excessive proliferation of cells, often 
with excessive accumulation of extracellular matrix material. These diseases include rheumatoid arthritis, psoriasis, 
idiopathic pulmonary fibrosis, sclerodemna, and cirrhosis of the liver, as well as the disease process atherosclerosis. 
The possibility exists that retinoids, which can influence cell differentiation and proliferation, may be of therapeutic 
value in some of the proliferative diseases. There exists a need in the art for reagents and methods for carrying out 

50 studies of receptor expression and effector function to determine whether candidate drugs are agonists or antagonists 
of retinoid activity in biological systems. 

There also exists a need in the art for identification of retinoic acid receptors and for sources of retinoic acid 
receptors in highly purified form. The availability of the purified receptor would make it possible to assay fluids for 
agonists and antagonists of the receptor. 

55 

SUMMARY OF THE INVENTION 

This invention aids in fulfilling these needs in the art. More particularly this invention provides a cloned DNA 
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sequence encoding for a polypeptide of a newly identified cellular gene, which has been named hap. The DN A sequence 
has the formula shown in figures 2a, 2b, 2c, 2d successively (and collectively designated as figure 2). More particularly, 
the sequence comprises a sequence encoding for a polypeptide named hap or a fragment thereof having the following 
formula : 

ATG TTT GAC TGT ATG GAT GTT CTG TCA GTG AGT CCT GGG CAA 
ATC CTG GAT TTC TAG ACT GCG AGT CCG TCT TCC TGC ATG CTC 
CAG GAG AAA GCT CTC AAA GCA TGC TTC AGT GGA TTG ACC CAA 
ACC GAA TGG CAG CAT CGG CAC ACT GCT CAA TCA ATT GAA ACA 
CAG AGC ACC AGC TCT GAG GAA CTC GTC CCA AGC CCC CCA TCT 
CCA CTT CCT CCC CCT CGA GTG TAC AAA CCC TGC TTC GTC TGC 
CAG GAC AAA TCA TCA GGG TAC CAC TAT GGG GTC AGC GCC TGT 
GAG GGA TGT AAG GGC TTT TTC CGC AGA AGT ATT CAG AAG AAT 
ATG ATT TAC ACT TGT CAC CGA GAT AAG AAC TGT GTT ATT AAT 
AAA GTC ACC AGG AAT CGA TGC CAA TAC TGT CGA CTC CAG AAG 
TGC TTT GAA GTG GGA ATG TCC AAA GAA TCT GTC AGG AAT GAC 
AGG AAC AAG AAA AAG AAG GAG ACT TCG AAG CAA GAA TGC ACA 
GAG AGC TAT GAA ATG ACA GCT GAG TTG GAC GAT CTC ACA GAG 
AAG ATC CGA AAA GCT CAC CAG GAA ACT TTC CCT TCA CTC TGC 
CAG CTG GGT AAA TAC ACC ACG AAT TCC AGT GCT GAC CAT CGA 
GTC CGA CTG GAC CTG GGC CTC TGG GAC AAA TTC AGT GAA CTG 
GCC ACC AAG TGC ATT ATT AAG ATC GTG GAG TTT GCT AAA CGT 
CTG CCT GGT TTC ACT GGC TTG ACC ATC GCA GAC CAA ATT ACC 
CTG CTG AAG GCC GCC TGC CTG GAC ATC CTG ATT CTT AGA ATT 
TGC ACC AGG TAT ACC CCA GAA CAA GAC ACC ATG ACT TTC TCA 
GAC GGC CTT ACC CTA AAT CGA ACT CAG ATG CAC AAT GCT GGA 
TTT GGT CCT CTG ACT GAC CTT GTG TTC ACC TTT GCC AAC CAG 
CTC CTG CCT TTG GAA ATG GAT GAC ACA GAA ACA GGC CTT CTC 
AGT GCC ATC TGC TTA ATC TGT GGA GAC CGC CAG GAC CTT GAG 
GAA CCG ACA AAA GTA GAT AAG CTA CAA GAA CCA TTG CTG GAA 
GCA CTA AAA ATT TAT ATC AGA AAA AGA CGA CCC AGC AAG CCT 
CAC ATG TTT CCA AAG ATC TTA ATG AAA ATC ACA GAT CTC CGT 
AGC ATC AGT GCT AAA GGT GCA GAG CGT GTA ATT ACC TTG AAA 
ATG GAA ATT CCT GGA TCA ATG CCA CCT CTC ATT CAA GAA ATG 
ATG GAG AAT TCT GAA GGA CAT GAA CCC TTG ACC CCA AGT TCA 
AGT GGG AAC ACA GCA GAG CAC AGT CCT AGC ATC TCA CCC AGC 
TCA GTG GAA AAC AGT GGG GTC AGT CAG TCA CCA CTC GTG CAA 
TAA, 
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The invention also covers variants and (ragments of the DNA sequence. The DNA sequence is in a purified form. 

This invention also provides a probe consisting of a radionucieotide bonded to the DNA sequence of the invention. 

In addition, this invention provides a hybrid duplex molecule consisting essentially of the DNA sequence of the 
invention hydrogen bonded to a nucleotide sequence of complementary base sequence, such as DNA or RNA. 

Further, this invention provides a polypeptide comprising an amino acid sequence of hap protein, wherein the 
polypeptide contains the amino acid sequence shown in Figure 2. More particularly, the amino acid sequence consists 
of the following sequence : 

Met Phe Asp Cys Met Asp Val Leu Ser Val Ser Pro Gly Gin 
lie Leu Asp Phe Tyr Thr Ala Ser Pro Ser Ser Cys Met Leu 
Gin Glu Lys Ala Leu Lys Ala Cys Phe Ser Gly Leu Thr Gin 
Thr Glu Trp Gin His Arg His Thr Ala Gin Ser lie Glu Thr 
Gin Ser Thr Ser Ser Glu Glu Leu Val Pro Ser Pro Pro Ser 
Pro Leu Pro Pro Pro Arg Val Tyr Lys Pro Cys Phe Val Cys 
Gin Asp Lys Ser Ser Gly Tyr His Tyr Gly Val Ser Ala Cys 
Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser lie Gin Lys Asn 
Met He Tyr Thr Cys His Arg Asp Lys Asn Cys Val He Asn 
Lys Val Thr Arg Asn Arg Cys Gin Tyr Cys Arg Leu Gin Lys 
Cys Phe Glu Val Gly Met Ser Lys Glu Ser Val Arg Asn Asp 
Arg Asn Lys Lys Lys Lys Glu Thr Ser Lys Gin Glu Cys Thr 
Glu Ser Tyr Glu Met Thr Ala Glu Leu Asp Asp Leu Thr Glu 
Lys He Arg Lys Ala His Gin Glu Thr Phe Pro Ser Leu Cys 
Gin Leu Gly Lys Tyr Thr Thr Asn Ser Ser Ala Asp His Arg 
Val Arg Leu Asp Leu Gly Leu Trp Asp Lys Phe Ser Glu Leu 
Ala Thr Lys Cys He He Lys He Val Glu Phe Ala Lys Arg 
Leu Pro Gly Phe Thr Gly Leu Thr He Ala Asp Gin He Thr 
Leu Leu Lys Ala Ala Cys Leu Asp He Leu He Lou Arg He 
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The invention also covers serotypic variants of the polypeptide and fragments of the polypeptide. The polypeptide is 
free from human serum proteins, virus, viral proteins, human tissue, and human tissue components. Preferably the 

25 polypeptide Is free from human, blood-derived protein. 

The hag protein (hap for hepatoma) exhibits strong homology with the human retlnoic acid receptor (RAR) de The, 
H., Marchio, A., Tiollais, P. & Dejean, A. Nature 330, 667-670 (1987). Petkovich, M.. Brand, N.J.. Krust, A. & Chambon, 
P. Nature 330. 444-450 (1987), GIguere, V., Ong. E.S., Segui, P. & Evans, R.M. Nature 330, 624-629 (1987). To test 
the possibility that the hap protein might also be a retinoid receptor, a chimaeric receptor'was created by replacing the 

30 putative DNA binding domain of hap with that of the human oestrogen receptor (ER). The resulting hap -ER chimaera 
was then tested for its ability to trans-activate an oestrogen -responsive reporter gene (vit-tk-CAT) in the presence of 
possible receptor ligands. It was discovered that retlnoic acid (RA) at physiological concentrations is effective in In- 
ducing the expression of this reporter gene by the hap-ER chimaeric receptor. See Nature, 332:850-853 (1988). This 
demonstrates the existence of two human retlnoic acid receptors designated RAR-a and RAR-p. 

35 More particularly. It has been discovered that the hap protein Is a second retinolc acid receptor. Thus, the expression 

"hap protein" is used interchangeably herein with the abbreviation "RAR-p" for the second human retlnoic acid receptor 
Also, this invention provides a process for selecting a nucleotide sequence coding for hap protein or a portion 
thereof from a group of nucleotide sequences comprising the step of determining which of the nucleotide sequences 
hybridizes to a DNA sequence of the invention. The nucleotide sequence can be a DNA sequence or an RNA sequence. 

40 The process can include the step of detecting a label on the nucleotide sequence. 

Still further, this invention provides a recombinant vector comprising lambda-NH1149 having an EcoRI restriction 
endonuclease site into which has been inserted the DNA sequence of the invention. The invention also provides plasmid 
pCOD20, which comprises the DNA sequence of the invention. 

This invention provides an E. coli bacterial culture in a purified form, wherein the culture comprises E. coll cells 

45 containinq DNA, wherein a portion of the DNA comprises the DNA sequence of the invention. Preferably the E. coll 
is stain TG-1 . 

In addition, this invention provides a method of using the purified retinolc acid receptor of the invention for assaying 
a medium, such as a fluid, for the presence of an agonist or antagonist of the receptor. In general, the method comprises 
providing a known concentration of a proteiaceous receptor of the invention, incubating the receptor with a llgand of 

50 the receptor and a suspected agonist or antagonist under conditions sufficient to form a ligand-receptor complex, and 
assaying for llgand-receptor complex or for free ligand or for non-complex receptor. The assay can be conveniently 
carried out using labelled reagents as more fully described hereinafter, and conventional techniques based on nucleic 
acid hybridization, immunochemistry, and chromotograph, such as TLC. HPLC, and affinity chromotography. 

In another method of the invention, a medium Is assayed for stimulation of trasncription of the RAR-p gene or 

55 translation of the gene by an agonist or antagonist. For example, p-receptor binding retinoids can be screened in this 
manner 
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BRIEF DESCRIPTION OF THE DRAWINGS 

This invention will be described in greater detail with reference to the drawings in which 

s Fig. 1 is a restriction map of hunnan liver liag cDNA; 

Fig. 2 (figs 2a, 2b, 2c, 2d) is the nucleotide sequence of hunnan liver hap cDNA and a predicted amino acid sequence 
of human liver hap cDNA; 

^0 Fig. 3 depicts the distribution of hag mRNA In different tissues as determined by Northern blot analysis; 

Fig. 4 depicts the distribution of hag mRNA in HCC and HCC derived cell lines as determined by Northern blot 
analysis; 

^5 Fig. 5 is a fluorograph of hap polypeptide synthesized in vitro and isolated on SDS-polyacrylamide gel; 

Fig. 6 shows the alignment of hap translated amino acid sequence with several known sequences for thyroid and 
steroid hormone receptors; 

20 Fig. 7 is a schematic alignment of similar regions identified as A/B, C, D. and E of the amino acid sequences of Fig. 6; 

Fig. 8 depicts hag related genes in vertebrates (A) and in humans (B and C) as determined by Southern blot 
analysis; 

25 Fig. 9 shows the tissue distribution of RAR a and p transcripts; 

Fig. 10 shows the dose- and time-response of RAR a and p transcripts after retinoic acid treatment of PLC/PRF/ 
5 cells; 

30 Fig. 11 shows the effect of RNA and protein synthesis inhibitors on the levels of RAR a and p mRNAs; 

Fig. 12 reports the results of nuclear run-on analysis of RAR p gene transcription after RA treatment; and 
Fig. 13 reports the results of nuclear run-on analysis of RAR p transcription in two hepatoma cell-lines; 

35 

Fig. 14 shows the resulting kinetic analysis of RAR mRNA degradation; 

Fig. 15 depicts a nucleotide sequence analysis extending a X 13 RAR-p by 72 bp; and 

40 Fig. 16 is a complete restriction map of a cloned Hindi II -BamHl genomic DNA insert containing the nucleotide 

sequence of Fig. 15. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

45 A. IDENTIFICATION OF A PROTEIN. NAMED liap PROTEIN. HAVING DNA-BINDING AND LIGAND-BINDING 
DOMAINS, AND IDENTIFICATION OF THE DNA SEQUENCE ENCODING hap PROTEIN 

As previously noted, ligand-dependent transcriptional activators, such as steroid or thyroid hormone receptors, 
have recently been cloned. The primary structure and expression of a new gene, hag, closely related to steroid or 
50 thyroid hormone receptor genes have now been discoverd. The hag product exhibits two regions highly homologous 
to the consen/ed DNA- and hormone-binding domains of previously cloned receptors. 

More particularly, the cloning of a cDNA corresponding to a novel steroidAhyroid hormone receptor-related gene 
has been achieved. The cDNA was recovered from a human liver cDNA library using a labelled cellular DNA fragment 
previously isolated from a liver tumor. The fragment contained a 147 bp putative exon in which HBV inserted. The 
55 sequence of this cellular gene, which Is referred to herein as hag for hepatoma, reveals various structural features 
characteristic of c-erbA /steroid receptors (Dejean et al., 1986). The receptor-related protein is likely to be a novel 
member of the superfamily of transcriptional regulatory proteins that includes the thyroid and steroid hormone receptors. 

It has been discovered that the hap gene is transcribed at low level in most human tissues, but the gene is over- 
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expressed in prostate and kidney. Moreover, six out of seven hepatoma and hepatonna-derived cell lines express a 
. small hap transcript, which is undetectable in normal adult and fetal livers, but is present in ali non-hepatic tissues 
tested. Altered expression of hag may be involved In liver oncogenesis. 

These findings, as well as other discoveries relating to this invention, will now be described in detail. 

5 

A.I Cloning and Sequencing of a Hap cDNA 

A human liver cDNA library was screened using a nick-translated 350 bp EcoRI genomic fragment (MNT probe) 
previously cloned from a hepatoma sample. The fragment contained the putative 147 bp cellular exon in which HBV 
10 integration took place (Dejean at al., 1986). 

Four positive 3' co-terminal clones were isolated from the 2 x 10® plaques screened and the restriction maps were 
deduced for each of the cDNA clone EcoRI inserts. The longest one was identified lambda-1 3. The restriction map of 
lambda-13 is shown in Fig. 1 . 

Referring to Fig. 1 , the insert of clone lambda-1 3 is nearly a full-length cDNA for the hag gene. Noncoding se- 
15 quences (lines) and coding sequences (boxed portion) are indicated. Restriction sites are: 
R EcoRI 
Bg Bglll 
M Mael 
XXhol 
20 K Kpni 

P Pvull 
8 BamHI 
H Hindlll. 

The lambda-1 3 clone was subjected to nucleotide sequence analysis. The nucleotide sequence is shown in Fig. 
25 2. The nucleotide sequence of the hag cDNA is presented In the 5' to 3' orientation. The numbers on the right refer to 
the position of the nucleotides. Numbers above the deduced translated sequence indicate amino acid residues. The 
four short open reading frames in the 5' untranslated region are underlined. Adenosine residues (20) are found at the 
3' end of lambda-1 3. The putative potyadenylation signal site (AATAAA) is boxed. The region homologous to the DNA- 
binding domain of known thyroid/steroid hormone receptors is indicated by horizontal arrows. The exon, previously 
30 cloned from a HCC sample genomic DNA library and in which HBV integration took place, is bracketed. 

This invention of course includes variants of the nucleotide sequence shown in Fig. 2 encoding hap protein or a 
serotypic variant of hag protein exhibiting the same immunological reactivity as hag protein. 

The DNA sequence of the invention is in a purified form. Generally, the DNA sequence is free of human serum 
proteins, viral proteins, and nucleotide sequences encoding these proteins. The DNA sequence of the invention can 
35 also be free of human tissue. 

The DNA sequence of the invention can be used as a probe for the detection of a nucleotide sequence in a biological 
material, such as tissue or body fluids. The polynucleotide probe can be labeled with an atom or inorganic radical, 
most commonly using a radionuclide, but also perhaps with a heavy metal. 

In some situations it is feasible to employ an antibody which will bind specifically to the probe hybridized to a single 
40 stranded DNA or RNA. In this instance, the antibody can be labeled to allow for detection. The same types of labels 
which are used for the probe can also be bound to the antibody in accordance with known techniques. 

Conveniently, a radioactive label can be employed. Radioactive labels include ^^p^ ^H, ^^C, or the like. Any radi- 
oactive label can be employed, which provides for an adequate signal and has sufficient half-life. Other labels include 
ligands, that can serve as a specific binding member to a labeled antibody, fluorescers, chemiiuminescers, enzymes, 
45 antibodies which can sen/e as a specific binding pair member for a labeled ligand. and the like. The choice of the label 
will be governed by the effect of the label on the rate of hybridization and binding of the probe to the DNA or RNA. It 
will be necessary that the label provide sufficient sensitivity to detect the amount of DNA or RNA available for hybrid- 
ization. 

Ligands and anti-ligands can be varied widely. Where a ligand has a natural receptor, namely ligands such as 
50 biotin, thyroxine, and Cortisol, these ligands can be used in conjunction with labeled naturally occurring receptors. 
Altemalively, any compound can be used, either haptenic or antigenic, in combinations with an antibody. 

Enzymes of interest as labels are hydrolases, particularly esterases and glycosidases, or oxidoreductases, par- 
ticularly peroxidases. Fluorescent compounds include fluorescein and its derivatives, rhodamine and its derivatives, 
dansyl, umbelliferone, etc. Chemiiuminescers Include luciferin and luminol. 

55 

A. 2. Amino Acid Sequence of Protein Encoded by hap Gene 

Based upon the sequence of the hap cDNA, the amino acid sequence of the protein encoded by hap gene was 
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determined. With reference to Fig. 2, the deduced amino acid sequence encoded by the gene reveals a long open 
reading frame of 448 amino acids corresponding to a predicted polypeptide of relative molecular mass 51 ,000. 

A putative initiator methionine codon and an in-frame terminator codon are positioned respectively at nucleotides 
322 and 1 666 in the sequence (Fig. 2). Despite that two other methionine codons are found 4 and 26 triplets downstream 
from the first ATG, the first one is the initiation codon of translation. 

The coding sequence is preceded by a 5' region of at least 321 nucleotides which contains four short open reading 
frames delineated by initiator and stop codons (Fig. 2). Translation usually starts, in eukaryotes, at the 5' most ATG 
triplet, but the finding of open reading frames in the 5' 'untranslated' region is not unprecedented (Kozak, 1986). It is 
not known yet whether those sequences are used for translation and exert any function in the cell. 

In the 3' untranslated region, 1326 nucleotides long, no long open reading frame is present. A putative polyade- 
nylation signal (AATAAA) is found 19 bp upstream from the polyadenylation site. 

It will be understood that the present invention is intended to encompass the protein encoded by the ha2 gene, i. 
e. hag protein, and fragments thereof in highly purified form. The hag protein can be expressed in a suitable host 
containing the DNA sequence of the invention. This invention also includes polypeptides in which all or a portion of 
the binding site of ha£ protein is linked to a larger carrier molecule, such as a polypeptide or a protein, and in which 
the resulting product exhibits specific binding in vivo and in vitro . In this case, the polypeptide can be smaller or larger 
than the proteinaceous binding site of the protein of the invention. 

It will be understood that the polypeptide of the invention encompasses molecules having equivalent peptide se- 
quences. By this it is meant that peptide sequences need not be identical. N^riations can be attributable to local mu- 
tations involving one or more amino acids not substantially affecting the binding capacity of the polypeptide. Variations 
can also be attributable to structural modifications that do not substantially affect binding capacity. Thus, for example, 
this invention is intended to cover serotypic variants of hap protein. 

Three particular regions of hag gene are of interest. Two of them are located in the D region (amino acids comprised 
between 46 and 196), which have been shown by the inventors to be highly immunogenic. Amino acids 46-196 have 
the sequence: 

X3lnHisArgHisThrAlaGlnSerrieGluThrGlnSerThrS€rSerGluGlu 

LeuValProSerProProSerProLeuProProProArgValTyrLysProCysPheValC/s 

GlnAspLysSerSerGlyTyrHisTyrGlyValSerAlaCysGluGlyCysLysGlyPhePhe 

ArgArgSerlleGlnLysAsnMetlleTyrThrCysHisArgAspLysAsnCysVallleAsn 

LysValThrArgAsrtArgCysGlnTyrCysArgLeuGlnLysCysPheGluValGlyMetSer 

LysGluSerValArgAsnAspArgAsnLysLysLysLysCluThrSerLysGLnGluCysThr 

GluSerTyrGluMetThrMaGluLeuAspAspLeuThrGluLysIleArgLysAlaHisGln 

CluThrPheProSerLeuCys . 

One peptide of interest in the D region is comprised of acids 151-167 and has the sequence: 

ValArgAsnAspAsgAsnLysLysLysLysGluThrSerLysGlnGluCys . 

A second peptide in the D region is located between amino acids 175 and 185. This peptide has the amino acid 
sequence: 

AlaGluLeuAspAspLeuThrGluLysIleArg. 

Another peptide of interest is located at the end of C region between amino acids 440 and 448. This peptide has 
the amino acid sequence: 

GlyValSerGlnSerProLeuValGIn. 

other peptides having formulas derived from the nucleotide sequence of hap gene can be used as reagents. 
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particularfy to obtain antibodies for diagnostic purposes, as defined hereinabove. 

The most favorable region is found in the hinge region (annino acids 147 to 193). This region includes amino acids 
150 to 1 70. corresponding to the following criteria: 

The region includes very hydrophilic sequences, namely the sequences 154-160 (No. 1/Hopp); 155-161 (No. 
1/Doolittle); 155-159 (No. 1/acrophilic). 

The region includes a peptide, namely, amino acids 156-162, No. 5 in mobility. 
- The polypeptide of this region has a low probability of adopting a structure in the form of a folded sheet or a helix, 
but, in contrast, a good probability of an omega loop and one beta-turn, very marked in the Asp-Arg-Asn-Lys 
tetrapeptide. 

The region does not have a potential site of N-glycosy!ation nearby; several suggestions in this zone can be made: 

Val-Arg-Asn-Asp'Arg-A3f\-Lys-Lys-Lys-Lys-Glu-Thr-Ser-Lys- 
Gln-Glu-Cys (peptide I); 

Peptide 1 corresponds to amino acids 151-167 and permits finding Cys 167, which is present in the sequence and 
enables attachment to a carrier (It will be noted that this peptide corresponds to a consensus sequence of phos- 
phorylation by kinase A). 

Peptide 1 can be shortened by N-turn while preserving the beta-tum and by C-tum while replacing Ser by Cys to 
maintain the possibility of coupling at this level: 

Asn-Asp-Arg-Asn-Lys-Lys-Lys-Lys-Glu-Thr-Cys (peptide 2). 

Peptide 2 is also favorable, but Is clearly less favorable than Peptide 1 from the viewpoint of hydrophilicity as of its 
higher potential for spatial organization (probably as amphlphilic helix). 

Finally, it will be noted that the C-terminal end constitutes a preferred region as a function of its mobility, but it 
nevertheless remains very hydrophobic. For example, the following peptide is contemplated: 

Cys-Gly-Val-Ser-Gln-Ser-Pro-Leu-Val-Gln (peptide 3), 

Peptide 3 can be fixed in a specific manner by an N-terminal Cys in such a way as to reproduce its aspect on the protein. 
The nucleotide sequences of hap gene encoding those peptides are as follows: 

For peptide 1 : 

GTCAGGAATGACAGGAACAACAAAAAGAAGGAGACTTCCAAGCAAGAATGC. 

For peptide 2: 

GGGGTCACTCAGTCACCACTCGTGCAA . 

For peptide 3: 

AATGACAGGAACAAGAAAAAGAACCAGACT * 

For peptide of amino acids 175-185: 

GCTGAGTTGGACCATCTCACAGAGAAGATTCCGA. 

The polypeptides of the invention can be Injected in mice, and monoclonal and polyclonal antibodies can be ob- 
tained. Classical methods can be used for the preparation of hybridomas. The antibodies can be used to quantify the 
amount of human receptors produced by patients in order to correlate the pathological states of illness and quantity 
of receptors or the absence of such receptors. 

Epitope-bearing polypeptides, particularly those whose N-terminal and C-terminal amino acids are free, are ac- 
cessible by chemical synthesis using techniques well known in the chemistry of proteins. For example, the synthesis 
of peptides In homogeneous solution and in solid phase is well Known. 

In this respect, recourse may be had to the solid phase synthesis of peptides using the method of Merrlfield, J. 
Am. Chem. Assoc. 85, 2149-2154 (1964) or the method of synthesis in homogeneous solution described by Houben- 
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weyl in the work entitled "Methoden der Organischen Chemie" (Methods of Organic Chemistry), edited by E. WUNSCH, 
vol. 15-1 and II, THIEME, Stuttgart (1974). 

This method of synthesis consists of successively condensing either the successive amino acid in pairs in the 
appropriate order, or successive peptide fragments previously available or formed and containing already several ami- 

5 noacyl residues in the appropriate order, respectively. Except for the carboxyl and amino groups which will be engaged 
in the formation of the peptide bonds, care must be taken to protect beforehand all other reactive groups borne by 
these aminoacyl groups and fragments. However, prior to the formation of the peptide bonds, the carboxyl groups are 
advantageously activated according to methods well known in the synthesis of peptides. Alternatively, recourse may 
be had to coupling reactions bringing into play conventional coupling reagents, for instance of the carbodiimide type 

^0 such as 1-ethyl-3-(3-dimethyl-aminopropyl)-carbodiimide. When the amino acid group carries an additional amino 
group (e.g. lysine) or another acid function (e.g. glutamic acid), these groups may be protected by carbobenzoxy or t- 
butyloxycarbonyl groups, as regards the amino groups, or by t-butylester groups, as regards the carboxylic groups. 
Similar procedures are available for the protection of other reactive groups. For example, SH group (e.g. in cysteine) 
can be protected by an acetamidomethyl or paramethoxybenzyl group). 

'5 In the case of progressive synthesis, amino acid by amino acid, the synthesis preferably starts by the condensation 

of the C-terminal amino acid with the amino acid which corresponds to the neighboring aminoacyl group in the desired 
sequence and so on, step by step, up to the N-terminal amino acid. Another preferred technique that can be relied 
upon is that described by R.D. Merrifield in "Solid Phase Peptide Synthesis' (J. Am. Chem. Soc., 45, 2149-2154). In 
accordance with the Merrifield process, the first C-terminal amino acid of the chain is fixed to a suitable porous polymeric 

20 resin by means of its carboxylic group, the amino group of said amino acid then being protected, for example, by a t- 
butyloxycarbonyl group. 

When the first C-terminal amino acid is thus fixed to the resin, the protective group of the amino group Is removed 
by washing the resin with an acid, i.e. trifluoroacetic acid when the protective group of the amino group is a t-butyloxy- 
carbonyl group. 

25 Then the carboxylic group of the second amino acid, which is to provide the second aminoacyl group of the desired 

peptide sequence, is coupled to the deprotected amino group of the C-terminal amino acid fixed to the resin. Preferably, 
the carboxyl group of this second amino acid has been activated, for example by dicyclohexylcarbodlimide, while its 
amino group has been protected, for example by a t-butyloxycarbonyl group. The first part of the desired peptide chain, 
which comprises the first two amino acids, is thus obtained. As previously, the amino group Is then deprotected, and 

30 one can further proceed with the fixing of the next aminoacyl group and so forth until the whole peptide sought is 
obtained. 

The protective groups of the different side groups, if any. of the peptide chain so formed can then be removed. 
The peptide sought can then be detached from the resin, for example, by means of hydrofluoric acid, and finally re- 
covered in pure form from the acid solution according to conventional procedures. 
35 Depending on the use to be made of the proteins of the invention, it may be desirable to label the proteins. Examples 

of suitable labels are radioactive labels, enzymatic labels, flourescent labels, chemiluminescent labels, or chromo- 
phores. The methods for labeling proteins of the Invention do not differ in essence from those widely used for labeling 
immunoglobulin. 

40 A. 3. Tissue Specific mRNA Distribution 

In order to study expression of the hap gene, Northern blot analysis was performed using MNT as a probe and 
poly(A)+ RNA extracted from various human tissues and cell lines. The results are shown in Figure 3. 

More particularly, Northern blot analyses were performed with poly(A)+ RNAs (15 ug per lane) extracted from 
45 different human organs and cell lines. A control hybridization with a mouse beta-actin cDNA probe is shown below the 
hybridizations in Fig. 3. Hap mRNA in different tissues Is shown in Fig. 4A as follows: 
Lane a ovary 
t_ane b uterus 

Lane c HBL 100 mammary cells 
so Lane d adult spleen 

Lane e 18 weeks fetal spleen 
Lane f K562 

Lane g HL60 hematopoeitic cell lines 
Lane h prostatic adenoma 
55 Lane I kidney 

Lane j adult liver 

Lane k 18 weeks fetal liver. 

Lanes a-k correspond to a one day exposure. 



10 



EP 0 321 362 B1 



Fig. 3 shows that two RNA species of 3 kb and +2.5 kb (the size of this smaller mRNA is slightly variable from one 
organ to another) were expressed at low abundance in ovary (lane a), uterus (lane b), HBL 100 mammary cells (lane 
c). adult and fetal spleen (lane d and e, respectively), and K562 and HL60 hematopoeitic cell lines (lanes f and g, 
respectively). Surprisingly, an approximately tenfold higher level of expression was detected in prostatic adenoma (lane 
5 h) and kidney (lane i). By contrast, a single mRNA of 3000 nucleotides, expressed at low levels, was present in poly 
(A)+ RNA from adult and fetal liver tissues (lanes j and k). Therefore, the cloned hag cDNA is likely to be a full-length 
copy of this transcript. 

The finding of two mRNA species overexpressed in prostate and kidney as well as the presence of a single mRNA 
expressed at low level in adult and fetal livers shown that hag expression is differentially regulated in those organs. 
10 This tissue specific expression provides some indication that prostate and kidney as well as liver, could be key tissues 
and that hag functions in those cell types may differ. 

Fig. 4 shows hap mRNA in HCC and HCC derived cell-lines as follows: 

Lane a, normal liver (four days autoradiography); 
15 Lanes b, c, d; three HCC samples (Lane b, patient Ca; Lane c, patient Mo; Lane d, patient TC1); 

Lanes e. f, g: three HCC-derived cell lines (Lane e, PLC/PRF/5; Lane f, HEPG2; Lane g, HEP 38). 

The lanes b-g correspond to a one day exposure. Once again, a control hybridization with a normal beta-actin cDNA 
probe Is shown below the hybridizations. 

20 With reference to Fig. 4. the smaller 2.5 kb mRNA was undetectable, even after long exposure, in three adult and 

two fetal human livers analyzed (Fig. 4, Lane a). This differential expression in normal livers may suggest a distinct 
role of hap in this particular tissue. 

Northern blot analysis of human HCCs and hepatoma cell lines showed almost constant alterations in hag tran- 
scription. There are two possible alternatives to explain this result. The smaller mRNA species can be simply expressed 

25 as a consequence of the cellular dedifferentiation. The tumorous liver cell, having lost its differentiated characteristics, 
would behave as any other cell type and thus express the same 2.5 kb mRNA as found in non-hepatic cells. However, 
the inability to detect such a smaller transcript in fetal livers does not seem to favor this hypothesis. On the contrary, 
the presence of the smaller transcript may have preceded the tumorigenesis events and would rather reflect a prene- 
oplastic state. The presence of an inappropriately expressed hag protein, normally absent from normal hepatocytes, 

30 may have directly participated to the hepatocellular transformation. In this respect, the previous study reporting a HBV 
integration In the hag gene of a human HCC (Dejean et al., 1986) strongly supports the idea that hag could be caus- 
atively involved in liver oncogenesis. Indeed, in this tumor, a chimeric gene between the viral pre-SI gene and hap may 
have resulted in the over-expression of a truncated hag protein. At present, it is the one found in non-hepatic tissues. 

35 A.4. Expression of hap in Hepatocellular Carcinoma 

Hap was first identified in a human primary liver cancer. Encouraged by this finding, poly(A)+ RNA from seven 
hepatoma and hepatoma-de rived cell lines were analyzed by Northern-blotting. Five of them contained integrated HBV 
DNA sequences. In addition to the 3 kb long mRNA found in normal adult and fetal liver, an additional +2.5 kb RNA 

40 species was observed, in equal or even greater amount, in three out of four HCC (Fig. 4, Lanes b, c, d) and in the PLC/ 
PRF/5, HEPG2 and HEP38 hepatoma cell-lines (Lanes e, f, g). The size of the smaller transcript was variable from 
sample to sample. In addition, the two transcripts were strikingly overexpressed, at least ten fold, in the PLC/PRF/5 cells. 

To lest the possibility that the inappropriate expression of hap in those six tumors and tumorous cell-lines might 
be the consequence of a genomic DNA alteration, Southern-blotting of cellular DNA was performed using, as two 

45 probes, the MNT fragment together with a 1 kb EcoRI fragment corresponding to the 5' extremity of the cDNA insert 
(Fig. 2). No rearrangement and/or amplification was detected with any of these two probes which detect a different 
single exon (data not shown), suggesting that the hag gene was not altered at the genomic level. It is yet unknown 
whether the +2.5 kb mRNA, present in the liver tumorous samples and cell lines, corresponds to the same smaller 
transcript as that found in non-hepatic tissues. However, its presence in the liver seems to be clearly associated to the 

50 hepatocellular transformed state. 

A.5. Hormone-binding Assay 

Amino-acid homologies between the hap protein and the c-erbA/ steroid receptors support the hypothesis that hap 
55 may be a receptor for a thyroid/steroid hormone-related ligand. The ability to express functional receptors in vitro from 
cloned c-erb A /steroid receptor genes led to the use of an in vitro translation assay to identify a putative hag ligand. 

The coding region of hag was cloned into pTZ18 plasmid vector to allow in vitro transcription with the T7 RNA 
polymerase and subsequent translation in reticulocyte lysales. The results are shown in Fig. 5. More particularly ^^s- 
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methionine-labelled products synthesized using T7 polymerase-cataiysed RNA transcripts were separated on a 12% 
SDS-polyacrylamide gel. whicin was fluorographed (DMSO-PPO). The lanes in Fig. 5 are as follows: 

Lane a, pCOD 20 (sense RNA, 70 ng) 
5 Laneb, PCOD 20 (140 ng) 

Lane c, pCOD 14 (antisense RNA. 140 ng). 

Figure 5 shows that the hag RNA directed the efficient synthesis of a major protein, with a 51 K relative molecular 
mass, consistent with the size predicted by the amino acid sequence (lanes a and b), whereas the anti-sense RNA- 

^0 programmed lysate gave negligible incorporation (lane c). 

Because c-erbA and hap colocalize on chromosome 3 and are more closely related according to their amino acid 
sequence, (i25j).T3 (triiodothyronine), -reverse T3 (3.3' ,5'-triiodo-thyronine) and -T4 (thyroxine), were first tested for 
their binding with the in vitro translated hap polypeptide. No specific fixation with any of those three thyroid hormones 
could be detected. As a positive control, binding of a T3 was detected with nuclear extracts from HeLa cells. The results 

15 were negative as well when the experiment was repeated with (3H)-retinol, -retinoic acid, and -testosterone, which 
represent three putative iigands for hag whose receptors have not yet been cloned. Although it cannot excluded that 
hap may encode a hormone independent transcriptional activator, it is more likely that hag product, i.e. the hag protein, 
is a receptor for a presently unidentified hormone. 

20 A.6. Similarity of HAP Protein to Thyroid/Steroid Hormone Receptors 

The c-erbA gene product, recently identified as a receptor for thyroid hormone (Weinberger, et al., 1986; Sap et 
aL, 1986), as well as the steroid receptors, belong to a superfamily of regulatory proteins, which consequently to their 
binding with specific ligand, appear capable of activating the transcription of target genes (reviewed by Yamamoto. 
25 1 985). This activation seems to be the result of a specific binding of the hormone-receptor complex to high-affinity sites 
on chromatin. 

Comparative sequence analysis has been made between the following different cloned steroid receptors: 
glucocorticoid receptor (GR) (Hollenberg et al., 1985; Miesfeld et al.. 1986); 
oestrogen receptor (ER) (Green et al., 1986; Greene et al., 1986); 
^0 progesterone receptor (PR) (Conneely et al., 1 986; Loosfelt et al.. 1 986); and 

thyroid hormone receptor (c-erbA product) (Weinberger et al., 1986; Sap et al., 1986). 
Mutation analysis has also been carried out. (Kumar et al., 1986; Hollenberg et al., 1987; Miesfeld et al.. 1987). The 
results revealed the presence of two conserved regions representing the putative DNA-binding and hormone-binding 
domains of those molecules. It has now been discovered that hap protein is homologous to the thyroid/steroid hormone 
35 receptors. 

More particularly, homology previously reported between the putative 147 bp cellular exon (bracketed in Fig. 2) 
and the c-erbA /steroid receptor genes led us to compare the entire hag predicted amino acid sequence with hGR, rPR, 
hER, and hc-erbA/ thyroid hormone receptor. The five sequences have been aligned for maximal homology by the 
introduction of gaps. The results are depicted in Fig. 6. Specifically, the following nucleotide sequences were aligned 
40 after a computer alignment of pairs (Wilbur and Lipman, 1983): 
hap product, 

human placenta c-erbA protein (hc-erbA , Weinberger et al., 1986), 
human oestrogen receptor (hER, Green et al., 1986). 
rabbit progesterone receptor (rPR, Loosfelt et al., 1986), and 
45 human glucocorticoid receptor (hGR, Hollenberg et al., 1985). 

A minimal number of gaps (-) was introduced in the alignment. 

Amino acid residues matched in at least three of the polypeptides are boxed in Figure 6. The codes for amino 
acids are: 
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A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Histidine 
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(continued) 



JO 



15 



1 


lie 


Isoieucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


W 


Trp 


Tyrptophan 


Y 


Tyr 


Tyrosine 
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25 



30 



35 



40 



45 



SO 
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The sequence comparison analysis revealed that the two regions highly conserved in the thyroid/steroid hornnone 
receptors are similarly conserved in the hag product. Consequently, the overall organization of hag is much similar to 
that of the four receptors in that it can be roughly divided into four regions (arbitrarily referred to as A/B, C, D and E 
(Krust et al.. 1986)). 

In C, the most highly conserved region, extending from amino-acid 81 to 146 in hag, the nine cysteines already 
conserved between the four known receptors are strikingly present at the same positions. Comparison between the 
cysteine-rich region of hap with the corresponding region of the four receptors reveals 64% amino acid identity with 
hc-erbA . 59% with hER, 42% with rPR and 44% with hGR. This is schematically represented in Fig. 7. 

Referring to Fig. 7, a schematic alignment of the five proteins can be seen. The division of the thyroid/steroid 
hormone receptor regions A/B, C, D, E is schematically represented in the hag protein. The two highly conserved 
regions, identified as the putative DNA-binding (region C) and hormone-binding (region E) domains of the receptors, 
are shov\m as stippled blocks. The numbers refer to the position of amino acid residues. The sequences of each of the 
hc-erbA product, hER, rPR and hGR receptors are compared with the hap protein. The numbers present in the stippled 
blocks correspond to the percentage of homology between hap protein on the one hand and each of the receptors on 
the other hand in the two highly conserved regions C and E. The empty blocks correspond to the non-conserved A/B 
and D regions. 

It has also been found that hag shares 47% homology in the C region with the chicken vitamin D3 receptor (VDR), 
recently cloned as a partial cDNA (McDonnel et al, 1987) (data not shown). Apart from c-erbA . which contains two 
additional residues, the 66 amino acid long C region shows a constant length in hER, VDR, hGR, rPR and hag se- 
quences. 

Region E (residue 195-448), which is well-conserved, but to a lesser extent, shows a slightly stronger homology 
to hc-erbA (38%) (Fig. 7). The hap/hc-erbA homology, however, remains inferior to the identity found between hGR 
and rPR (90 and 51 per cent in regions C and E, respectively). No significant homology was observed when comparing 
the A/B (residue 1-80) and D (147-194) regions which are similarly variable, both in sequence and length, in the four 
known receptors. 

It is thus evident from Figs. 6 and 7 that the hag product exhibits two highly homologous regions. The C domain 
is characterized by strikingly conserved Cys-X2-Cys units, evoking those found in the DNA-binding transcriptional 
factor TFIIIA (Miller et al., 1985) and in some protein that regulated development, such as Kruppel (Rosenberg et al., 
1986). In the latter, the Cys-X2-Cys, together with His-X3-His units, can form metal binding fingers that are crucial for 
DNA-binding (Berg, 1986; Diakun et al.. 1986). Similarly the C domain of previously cloned receptors are likety to 
contain metal binding fingers and were shown to bind DNA (Hollenberg et al., 1987; Miesfeld et al., 1987). Since the 
C region of the hap gene product shares 24/66 conserved amino acids with all all steroid or thyroid hormone receptors, 
including all nine cysteine residues, it is likely that the hap protein is a DNA-binding protein. Hap , as c-erb/V steroid 
receptors, may modulate the transcription of target genes. 

In addition, the the significant homology detected in the E domain suggests that hap product is a Itgand-bindtng 
protein and directs the question of the nature of the putative ligand. Hag protein seems to differ too much from previously 
cloned hormone receptors to be a variant of one of them. In addition, the in vitro translated 51 K hap polypeptide failed 
to bind all ligands tested. Although that hag gene product could be a ligand-independent DNA-binding protein, it is 
believed that hap encodes a receptor for a presently unidentified circulating or intracellular ligand. 

It has been proposed that steroid and thyroid hormone receptor genes were derived from a common ancestor 
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(Green and Chambon, 1 986). This primordial gene may have provided to the receptors their common scaffolding while 
the hormone and target gene cellular DNA specificities were acquired through mutations accumulated in the C and E 
domains. Hap is both linked to the steroid receptor gene by its shorter C domain (66AA) and to the thyroid hormone 
receptor genes by its clearly greater homology with c-erbA in the E region (38%). This suggests that hap llgand may 

5 belong to a different hormone family. 

Different functions have been assigned to the four regions defined in the glucocorticoid and oestrogen receptors 
(Kumar et al., 1986; Giguere et al., 1986; Miesfeld et al., 1987). By analogy, the regions C and E may represent, 
respectively, the putative DNA-bindIng and hormone-binding domains of the ha^ protein. The precise functions of the 
A/B and D domains remain unknown. The presence of the amino-terminal A/B region of the human GR has been 

10 recently shown to be necessary for full transcriptional activity (Hollenberg et al., 1 987), whereas results obtained with 
the rat GR indicated it was dispensable (Miesfeld et al., 1987). From this alignment study it appears that hap is distinct, 
but closely related to the thyroid/steroid hormone receptor genes suggesting that its product may be novel ligand- 
dependent, DNA-binding protein. 

'5 A. 7. Hap related genes 

Southern blotting was performed on restriction enzyme-digested DNAs obtained from different organisms with 
labelled genomic MNT fragment containing the first exon of the cysteine-rich region of hag. The results are shown in 
Fig. 8. More particularly, hag related genes in vertebrates (A) and in humans (B and C) were compared. Cellular DNA 
20 (20 ug) from various sources was digested with Bglll and subjected to Southern blot analysis using the MNT probe 
under non-stringent hybridization and washing conditions. The lanes in Fig. 8A are identified as follows: 
Lane a human liver 
Lane b domestic dog liver 
Lane c woodchuck (marmota monax) 
2S Lane d mouse liver (BALB/c strain) 

Lane e chicken erythrocytes 
Lane f cartilaginous fish (Torpedo). 
As illustrated in Fig. 8A, Bajll fragments that anneal effectively with MNT probe under non-stringent hybridization 
and washing conditions are present in digests of DNA from several mammals (mouse,' woodchuck, dog) as well as 
30 from bird and fish. If this blotting experiment is performed at high stringency, no hybridization is observed with heter- 
ologous DNA (data not shown). These data suggest that the hybridizing sequences represent evolutionarily conserved 
homologs of hap . 

The existence of multiple c-erbA and GR genes (Jansson et al., 1983; Weinberger et al., 1986; Hollenberg etal.. 
1985) encouraged a search for hag related genes in the human genome. Thus, human liver DNA digested by .Pstl. 

3S Bam HI. and Eco RI was analyzed by Southern blot, using the MNT probe, under stringent conditions. The results are 
shown in Fig. 88. After digestion of liver DNA by PstI (lane a), Bam HI (lane b) or EcoRl (lane c) a single band is 
obsen/ed with the MNT probe in high stringency hybridization. 

The same blot was hybridized with the MNT probe under non-stringent hybridization and washing conditions. The 
results are shown in Fig. 8C. When Southern blotting was performed under relaxed hybridization conditions, additional 

40 bands were observed in the products of each enzyme digestion (Fig. 8C, lanes a, b, c). For example, seven faint 
hybridizing fragments of 1, 1.7, 2.4, 3.8, 5.5, 6, 7.4 kb were observed in the Bam HI digestion (lane b). None of those 
bands cross-hybridized with a human c-erbA probe (data not shown). A minimum of three faint bands in the PstI lane 
suggests the existence of at least four related hap genes in the human genome. . 

From a panel of somatic cell hybrids, hag was assigned to chromosome 3 (Dejean et al.. 1 986). To find out whether 

4S the hag related genes were all chromosomally linked or not, DNAs from human liver LA.56U and 53K cell-lines (two 
mouse/human somatic cell hybrids containing, altogether, most human chromosomes except chromosome 3 (Nguyen 
Van Cong et al., 1986)), and mouse lymphoid cells were Bam HI digested, transferred to nitrocellulose, and hybridized 
to the MNT probe in low-stringency conditions. Of the seven faint bands present in the human liver DNA track, two at 
least were conserved in the LA.56U and/or L.53K cell lines DNAs digestion (data not shown) indicating that some of 

50 the hag genes do not localize on chromosome 3. Altogether the results suggest that hag belongs to a multigene family 
consisting of at least four members dispersed in the human genome. 

The experimental procedures used in carrying out this invention will now be described in greater detail. 

A.8. EXPERIMENTAL PROCEDURES 

55 

A. 8. 1 . cDNA Cloning and Screening 

Briefly, the cDNA was synthesized using oligo dT primed poly-A+ liver mRNA, using the method of Gubler and 



14 



EP 0 321 362 B1 



Hoffman (1983) (C. de Taisne, unpublished data). cDNA's were size selected on a sucrose gradient and the fraction 
corresponding to a mean size of 3 kb was treated with EcoRI methylase. After addition of EcoRl !inkers,the cDNA was 
digested by EcoRI and ligated to an EcoRI restricted lambda-NM1U9. After in vitro encapsldatlon, the phages were 
amplified on C600 hfl and 2. 10^ recombinant were plated at a density of 10,000 per dish. The dishes were transfered 
5 to nylon filters and hybridized to the 350 bp EcoRI -EcoRI genomic fragment (MNT) previously described (Dejean et 
al.. 1986). Four positive clones were Isolated and the restriction map of each insert was determined. The longest one, 
clone lambda-13, was subjected to nucleotide sequence analysis. 

A. 8. 2. Nucleotide Sequence 

10 

Clone lambda-13 DNA was sonicated, treated with the Klenow fragment of DNA polymerase plus deoxyribonucle- 
otides (2h, 15°C) and fractionated by agarose gel electrophoresis. Fragments of 400-700 bp were excised and elec- 
troeluted. DNA was ethanol-precipitated, ligated to dephosphorylated Sma1 cleaved Ml 3 mpB replication form DNA 
and transfected into Excherichia coll strain TG-1 by the high-efficiency technique of Hanahan (1983). Recombinant 
15 clones were detected by plaque hybridization using either of the four EcoRI fragments of cDNA insert as probes (Fig. 
1). Single-stranded templates were prepared from plaques exhibiting positive hybridization signals and were se- 
quenced by the dideoxy chain termination procedure (Sanger et al. , 1 977) using buffer gradient gels (Biggin et a!., 1 983). 

A.8.3. Northern Blot 

20 

Cytoplasmic RNA was isolated from the fresh tissue using guanidine thiocyanate, and the RNA cell line was ex- 
tracted using isotonic buffer and 0.5% SDS, 10 mn Na acetate pH 5.2. RNAs were then treated with hot phenol. Poly 
(A)+ RNA (1 5 ^g) of the different samples were separated on a 1 % agarose gel containing glyoxal. transfered to nylon 
filters and probed using the nick-translated MNT fragment. The experimental procedure is described in Maniatis et al. 
25 (1982). 

A.8.4. Southern Blot 

20 ug of genomic DNA was digested to completion, fractionated on a 0.8% agarose gel and transfered to nylon 
30 paper. Low stringency hybridization was performed as follows: 24 h prehybridization in 35% formamlde, 5x Denhardt, 
5x SSC, 300 ug/ml denatured salmon sperm DNA, at 40*0; 48 h hybridization with 35% formamlde, 5x Denhardt, 5x 
SSC, 10% Dextran sulfate, 2.10® cpnn/ml denatured ^^p labelled DNA probe (specific activity S.IO^cpnrVug). Washes 
were made in 2x SSC, 0.1 SDS, 55"C for 15 min. High stringency hybridization conditions were the same except that 
50% formamlde was used with 24 h hybridization. Washing was in 0.1 x SSC, 0.1 SDS, 55°C for 30 min. 

35 

A. 8. 5. Construction of Plasmids for In-Vltro Translation 

The 3 kb insert of phage lambda-13 was excised from the phage DNA by partial EcoRI digestion, electroeluted 
and digested by BamHI and Hindlll. To remove most of the untranslated sequences, the 1 .8 kb cDNA fragment obtained 
40 was then partially digested by Mae1 (Boehringer). The 1.4 kb Mae l -Mae l fragment, extending from the first to the 
third Mae l site In the cDNA insert sequence (Fig. 1) and containing the complete coding region was mixed with Smal 
cut dephosphorylated pT218 (Pharmacia), the extremities were filled in using Kleenow fragment of DNA Poll (Amer- 
sham) and ligated. Two plasmids were derived: pCOD20 (sense) and pC0D14 (antisense). 

^5 A. 8. 6. Translation and hormone binding assays 

pCOD20 and pCOD14 were linearized with Hind lll. Capped mRNA was generated using 5 ug of DNA, 5 uM rNTP, 
25 mM DTT, 100 U RNAsin (Promega). 50 U T7 Pol (Genofit) in 40 mM Tris pH 8. 8mM MgCIa, 2 mM spermidine, 50 
mM NaCI, in 1 00 ul at 37*C. Capping was performed by omitting GTP and adding CAP (m^ G (5') ppp (50 G) (Pharmacia) 

50 for the 15 first minutes of the reaction. Translation was performed using rabbit reticulocyte lysate (Amersham) under 
the suggested conditions using 40 ul of lystae for 2.5 ug of capped RNA. 

The thyroid hormone binding assays Included 5 ul of lysate In (0.25 M sucrose. 0.25 KCI, 20 mM Tris (pH 7.5), 1 
mM MgCl2, 2 mM EDTA, 5 mM DTT) with 1 mM i25| 74^ 125| 73 ori25| rT3 (specific activity: T4, rT3 1400 mCi/mg 
Amersham, T3 3000 mCi/mg NEN). After at least 2 h of incubation at O'C, free was separated from bound by filtration 

5S through millipore H AWP 02500 filters using 1 0 ml of ice cold buffer. For testosterone, retinol, retinoic acid 1 0 ul of lysate 
was added to 45 lambda of 20 mM Tris pH 7.3, 1 mM EDTA, 50 mM NaCI, 2 mM beta-mercaptoethanol and 5 mM 
testosterone, 400 mM retinol or 1 5 mM retinoic acid (81 Ci/mmol; 60 Ci/mmol; 46 Ci/mmol; Amersham). After an over- 
night incubation at 0°C free was separated from bound by Dextran coated charcoal (0.5% Norit A - 0.05% T70) and 
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centrifugation. All experiments were performed in duplicates and parallel experiments were performed with 100 fold 
excess corresponding cold hormone. 

B. DIFFERENTIAL EXPRESSION AND LIGAND REGULATION OF THE RETINOIC ACID RECEPTOR a AND B 
s GENES 

The recent cDNA cloning of several nuclear hormone receptors, Including the steroid and thyroid hormone recep- 
tors, has revealed that their overall structures were strikingly similar In particular, two highly conserved regions have 
been shown to correspond to the DNA- and hormone-binding domains (for review see Evans, 1988). 

^0 Analysis of a hepatitis B virus integration site in a human hepatocellular carcinoma led to the identification of a 

putative genomic exon highly homologous to the DNA-binding domain of other members of this nuclear receptor mul- 
tigene family (Dejean et al., 1986). Two different cDNAs homologous to this sequence have recently been cloned 
(Giguere et al., 1987; Petkovich et al., 1987; de The et al., (1987) and their translation products identified as retinoic 
acid receptors (designated RAR a and RAR p) (Giguere et al., 1987; Petkovich et al., 1987; Brand et al., 1988). The 

IS two receptors have almost Identical DNA- and hormone-binding domains but differ in their N-terminal part. Their re- 
spective genes map to different chromosomes, 1 7q21 . 1 for RAR a (Mattel et al., 1 988) and 3p24 for RAR p (Mattel et 
al., 1988), and their nucleotide sequences are only distantly related. Both genes are found in most species (Brand et 
al.. 1 988 and de The, unpublished results), suggesting an early gene duplication. Analysis of the RA-dependent gene 
transactivation also showed that the ED 50 of RAR a and p were significantly different (10-^ and lO'® M, respectively), 

20 indicating that RAR-p may mediate activation of transcription at RA concentrations 1 0-fold lower than those necessary 
for activation by RAR a (Brand et al.. 1988). 

The existence of two different retinoic acid receptors raises a number of questions as to the biological consequenc- 
es of the RAR gene publication. In particular, differences in the mechanisms of regulation or spatial expression patterns 
of the two receptors could account for distinct physiological roles. The tissue distribution of the transcripts for RAR a 

2S and p and their response to RA have been studied. The results show clear differences in the spatial patterns of ex- 
pression and Indicate that the p, but not the a, RAR gene is transcriptionally upregulated by RA in a protein synthesis- 
independent fashion. The discovery of differential expression of the RAR a and p genes, coupled with a selective 
regulation of RAR p gene expression by RA. may prove to be important components of retinoic acid physiology These 
findings strongly suggest that the two receptors are differentially involved In the various- biological effects of RA. The 

30 results obtained in the study are summarized below. 

The RAR a gene, which is transcribed as two mRNA species of 3.2 and 2.3 kb, Is overexpressed in the haemat- 
opoietic cell-lines and has an otherwise low level-expression in all the other human tissues examined. By contrast, the 
RAR p gene exhibits a much more varied expression pattern. Indeed, the two transcripts. 3 and 2.5 kb. show large 
variations in their levels of expression which range from undetectable (haematopoietic cell-tines) to relatively abundant 

35 (kidney, cerebral cortex, etc.). Run-on studies with the hepatoma cell-lines show that, at least in some tissues, these 
differences may be due to an increase In the transcription rate of the RAR p gene. These findings point to complex 
regulatory mechanisms of RAR gene expression that may confer the cells with various sensitivities to RA. 

The availability of cloned RAR cDNAs prompted an investigation of possible regulation of these receptor mRNAs 
by RA. Exposure of hepatoma cells to RA led to a rapid Increase in the level of RAR p transcripts, while the abundance 

40 of RAR a transcripts remain unaffected. The stimulation of expression of RAR p mRNAs was induced by physiological 
concentrations of RA in a dose-dependent manner. Such autoregulation is a general feature of hormonal systems and 
has been shown to take place at the mRNA and protein levels, in the case of the nuclear receptors for glucocorticoids 
(down-regulation, Okrent et al.. 1986) or vitamin D3 (up-regulation, McDonnell et al.. 1987). The RA-induced upregu- 
lation of the RAR p transcripts was observed in the pressence of protein synthesis inhibitors. In vitro nuclear transcript 

45 run-on assays show that the RA-lnduced increase in RAR p mRNAs levels is the consequence of an enhanced tran- 
scription. These findings demonstrate that the RAR p gene is transcriptionally unregulated by the RA and provide the 
first identification of a primary target gene for RA. The cloning of the promoter sequences of the RAR p gene should 
allow the identification of the upstream genomic elements implicated in RA responsiveness. The use of these sequenc- 
es will provide a useful tool to determine which one of the a and/or the p receptor Is involved in regulating p RAR gene 

50 expression. 

The haematopoietic cell-line HL60 has been widely used as a model for RA-induced differentiation (Strickland and 
Mahdavi, 1 978). The data from this invention suggest that in this system RAR a must be responsible for the RA-induced 
differentiated phenotype, since HL60 does not appear to have any RAR p mRNAs. Note in this respect that Davies et 
al. (1985) studying the RA-dependent transglutaminase expression in these cells have found an ED 50 of 5x1 O^M 
55 consistent with a RAR a-mediated transactivation. 

The upregulation of the P receptor gene by RA may have very important implications in developmental biology 
Morphogen gradients are frequently implicated In cell commitment (Slack, 1987). One example of this phenomenon is 
the polarization of the chick limb bud where RA. the suspected morphogen, forms a concentration gradient across the 



16 



EP 0 321 362 B1 



anterior-posterior axis of the developing bud (Thaller and Eichele, 1 987). However, the small magnitude of this gradient 
(2.5 fold) is puzzling and suggests the existence of amplification mechanisms (Robertson, 1987). Since transactivation 
of target genes is dependent upon both receptor and ligand concentrations, a small increase in RA may result in a 
disproportionately larger RAR p effect. The effect of this RA gradient could be potentiated by a corresponding gradient 
5 in RAR p receptors as a consequence of upregulation by RA itself. 

B.1. Tissue distribution of the a and 3 RAR mRNAs. 

To study the differential expression of the RAR a and p genes, Northern blot analysis was performed using 5 )ig 
10 (microgram) of poly(A) + RN A extracted from various human tissues and cell-lines. A RAR p clone previously identified 
(de The et al., 1 987) was used to isolate a partial cDNA clone for RAR a from a hepatoma cell-line cDNA library, and 
the two cDNA inserts were used as probes. More particularly poIy(A) -i- mRNA (5 ^g) from different human tissues and 
cell-lines was denatured by glyoxal, separated on a 1 .2% agarose gel, blotted onto nylon filters and hybridized to an 
a (Fig. 9, upper panel) then a p (Fig. 9. middle panel) RAR cDNA single-stranded probe (see materials and methods. 
'5 infra ). Exposure time was 36 h. The fitters were subsequently hybridized to a p actin probe (Fig. 9, lower panel) to 
ensure that equal amounts of RNA were present in the different lanes. The following abbreviations are used in Fig. 9. 
Sp. cord; spinal cord. C. cortex: cerebral cortex. K562 and HL60 are two haematopoietic cell-lines. PLC/PRF/5 is a 
hepatoma derived cell-line. 

Referring to Fig. 9, the spatial distribution patterns were clearly distinct between the two receptors. The RAR a 
20 probe hybridized to two transcripts of 3.2 and 2.3 kilobases (kb) with an approximately equal intensity. The two mRNAs 
were present at low levels in all tissues examined but were overexpressed in the haematopoietic cell-lines, K562 and 
HL60. 

When the same filters were hybridized with the RAR p probe, a much more variable transcription pattern was 
observed (Figure 9). Two mRNA species of 3 kb and 2.5 kb were visible in most tissues, except in the spinal cord and 

25 the liver (adult or fetal) where the smaller transcript was undetectable. Ma\or quantitative differences in the level of 
expression of the two transcripts were noted. The tissues examined could be classified into four groups with respect 
to expression of P receptor mRNAs: high (kidney, prostate, spinal cord, cerebral cortex, PLC/PRF/5 cells), average 
(liver, spleen, uterus, ovary), low (breast, testtis) and undetectable (K562 and HL60 cells). The use of a p probe that 
did not hybridize to a, allowed us to correct our previous description of the p RAR transcripts in these haematopoietic 

30 cell-lines (de The et al., 1987). The suppression of p receptor gene expression, associated with an overexpression of 
RAR a mRNAs seems to be a general feature of haematopoietic cell-lines, since similar results were obtained when 
we repeated the study using six other cell-lines (HEL, LAMA, U937. KGl. CCRF, Burkitt) (data not shown). 

8.2. RA-induced mRNA regulation . 

35 

To investigate whether retinoic acid modulates the expression of its own receptor, PLC/PRF/5 cells were grown in 
the presence of various concentrations of RA for different times, and RAR a and P mRNAs were analysed by Northern 
blot hybridization. More particularly, semi-confluent cells were grown for 6 hr in charcoal stripped medium and retinoic 
acid was then added to the medium at various concentrations {^0r^^ M to 10"® M) for 4 hr. Control cells were treated 

40 with ethanol (E). Northern -blotting was performed as described in connection with Figure 9. except that 30 of total 
RNA was used. Dose-response is shown in Fig. 10A. 

Another analysis was performed as in Fig. 10A, except that lO-^M RA was used for various times (0-12 h). Time- 
response is shown in Fig. 108. Exposure time was 12 hr for the p probe (Fig. 108, lower panel) and four days for the 
a probe (Fig. 108, upper panel). 

45 When the cells were treated with a high concentration of RA (lO^® M), a rapid increase in p receptor mRNAs was 

observed, and a dose-response analysts showed that this stimulatory effect was already evident at a RA concentration 
of 10-^ M (Fig. 10A. lower panel). From densitometry, the magnitude of the RA-induced upregulation was 10-fold. 

Since the PLC/PRF/5 cells constitutively overexpress the RAR p mRNAs (Fig. 9). the experiment was repeated 
using the HEPG2 hepatoma cell-line, which has a level of RAR p expression similar to that of normal adult liver (de 

so The et aL, 1987). In this case, there was a greater (50-fold) RA-induced stimulation of the levels of RAR p mRNAs 
(data not shown). Exposure of the PLC/PRF/5 cells to RA {^Q^^ M) during various periods indicated that the induction 
had a latency of one hour, was complete after four hours, and did not increase after an overnight treatment (Fig. 108, 
lower panel). After hybridizing the same filters with an RAR a probe, no variation was found in the level of the a receptor 
mRNAs (Fig. 10, upper panel), indicating that RA had no effect on the expression of the RAR a gene. 

55 

8.3. Effect of inhibitors . 

To investigate the mechanism of activation of RAR p gene by RA, experiments with PLC/PRF/5 cells were per- 
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formed in the presence or absence of various inhibitors of transcription or translation, or were treated with ethanol (E) 
as a control. 

More particularly. PLC/PRF/5 cells were exposed to charcoal stripped nnediunn for 6 hr; subsequently ethanol (E), 
HA (lO'^M) and/or inhibitors cyclohexinnide (CH) 10|ig/ml oractlnonnycin D (AC) (5 ^g/ml) were added for an additional 
5 4 hr. Northern -blotting was carried on using 30 )ig of total RNA. Figure 11 shows filters hybridized first to the RAR P 
probe (Fig. 11, right panel), then to the a probe (Fig. 11 , left panel), and finally toa pactin probe (Fig. 11, lower panel). 
Exposure times were the same as for the experiments in Figure 10. 

The RNA synthesis inhibitor actinomycin D (AC) abolished the RA-induced increase in the levels of RAR p tran- 
scripts (compare the RA+AC lane to the RA and E+AC lanes), while the protein synthesis inhibitor cycloheximide (CH) 
^0 did not (compare lanes RA+CH to CH). Neither RA. AC, nor CH significantly affected the levels of p actin mRNA (Fig. 
1 1 , lower panel). These findings suggest that RA-induction of the p receptor gene results from a direct transcriptional 
effect. When the same filters were rehybridized to the RAR a probe (Fig. 11 , left panel) the presence or absence of 
RA had no effect on the levels of RAR a mRNAs confirming that the RAR a gene is not regulated by RA. 

'5 B.4. Nuclear transcript elongation analysis . 

Nuclear run-on experiments were carried out to determine if the enhanced expression of the RAR p gene was due 
to increased transcription. PRF/PLC/5 cells were grown in the presence of ethanol (E) or retinoic acid (RA), their nuclei 
were isolated, and transcription was performed in the presence of (32p)UTP. The labelled RNAs were hybridized to 

20 filters containing single-stranded RAR p cDNA Inserts in the appropriate orientation (S (sense) 10 ^g and 1 ^ig), or in 
the reverse orientation (AS (antisense) 20 iig). A p actin control was also included. Exposure time was 12 hours. The 
results are shown in Figure 12. 

The specific hybridization, which reflects the transcription rate, is clearly induced by RA. In addition, the magnitude 
of the increase in RAR p mRNAs is comparable when assessed by run-on assays (5 to 7 fold) or Northern analysis (8 

25 fold to 10 fold). These experiments establish that the RAR p gene is transcriptionally upregulated by RA. 

Nuclear transcript elongation assays were also used to investigate whether the higher steady-state levels of RAR 
P mRNAs obsen/ed in the hepatoma cells PRF/PLC/5 compared to HEPG 2 (de The et al.. 1987), were related to 
differences in transcription rates. Transcript elongation assays were performed with PRF/PLC/5 and HEPG2 cells as 
described below in material and methods, in the absence of added RA. The filters contained, respectively, 10 ^g and 

30 20 ng of sense (S) and antisense (AS) RAR p cDNA inserts. Exposure time was 24 hours. The results are shown in 
Figure 13. 

A much greater specific hybridization signal, relative to the p actin control, was observed in PRF/PLC/5 cells com- 
pared to the HEPG 2 cells (Fig. 13), indicating that their transcription rates are different. This result suggests that at 
least some of the variations in RAR p expression in the human tissues and cell-lines (Fig. 9) might be due, in a similar 
55 manner, to differences in the transcription rates of the RAR p gene. 

B.5. Stability of RAR mRNAs 

The level of RAR p mRNAs was slightly higher after cycloheximidine treatment (compare the E lane to the CH 
40 lane in Fig. 11 , right panel). In the presence of RA, CH treatment caused approximately a 50-fold increase in the level 
of RAR p gene expression (compare lane E to RA+CH). Such superinduction by cycloheximide has been described 
for several genes and associated with either transcriptional or post-transcriptional mechanisms (Greenberg et al., 
1986). 

To determine whether RNA stabilization was involved in the induction by CH, PLC/PRF/5 cells were first stimulated 
45 for 3 hours by RA (10"®M) in the presence of CH (lOjig/ml) and extensively masked with culture medium. Transcription 
was then blocked by addition of actinomycin D (5 ^g/ml) and the level of RAR mRNAs was monitored for the next 5 
hours in the presence or absence of CH. Northern-blotting was done using 30 ^g of total RNA. The results are shown 
in Figure 1 4. The fitters were hybridized first to the RAR p probe (Fig. 1 4, right panel), then to the a probe (Fig. 1 4, left 
panel), and lastly to a p actin probe (Fig. 14, lower panel). Exposure times were as in Figure 10. 
50 Quantification of the RAR p mRNAs levels indicated that CH indeed stabilized the p transcripts, as CH increased 

their half-life from approximately 50 to 80 min (Fig. 1 4, right panel). The combined effect of increased transcription and 
reduced degradation may account for the synergistic effect of RA and CH on p mRNAs levels. In the case of RAR a, 
cycloheximide treatment caused only a slight increase in mRNAs levels and no superinduction by RA was observed 
(Fig. 11 , left panel). In addition, the a receptor mRNAs, which have a half life of at least 5 hours, are more stable than 
55 the RAR p transcripts (Fig. 14, left panel). A pentanucleotide, ATTTA, in NT rich 3' non-coding regions seems to 
mediate mRNA degradation (Shaw and Kamen, 1 986). The 3.2 kb RAR a transcript has an A/T poor 3' end (38%) and 
contains two such motifs (Giguere et al.. 1987; Petkovich et al., 1987), whereas the 3 kb RAR p mRNA has an A/T 
rich 3' end (68%) and four copies of ATTTA (de The et al., 1987). These findings are consistent with the differences in 
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RAR aand |3 mRNAs stability that have been observed. 

B 6. MATERIAL AND METHODS 

S.6.1 . Biological samples and cell-lines . 

Human tissue samples were obtained from early autopsies and kept at -80° C prior to extraction. The HEPG 2 and 
PLC/PRF/5 hepatoma cell-lines were grown in Dulbecco's modified Eagle's medium with 10% fetal calf serum, 
glutamine, and antibiotics, in 5% CO2. Semiconfluent celts were treated with RA after a 6 h wash-out in charcoal 
stripped medium. All-trans-retinoic acid was obtained from Sigma. Cycloheximide and actinomycin D (both from Sigma) 
were used at concentrations of 10 and 5 ^g/ml (micrograms/milliliter), respectively. 

B.6.2. RNA preparation . 

The RNA was prepared by the hot phenol procedure (Maniatis et al., 1 982). Poly(A)+ mRN A was prepared by oligo 
(dT)-cellulose chromatography. For Northern-blot analysis, total RNA (30 |ig) or poly(A)+ mRNA (5 (ig) was denatured 
by glyoxal and fractionated on a 1.2% agarose gel (Maniatis et al., 1982). The nucleic acid was transferred to nylon 
membranes (Amersham) by blotting and attached by UV exposure plus baking. 

B.6.3. Recombinant clones . 

The p receptor probe was a 600 bp fragment of the cDNA previously described (de The et al., 1987) extending 
from the 5' end to the Xho t site, corresponding to 5' untranslated region and the A/B domain. The a receptor probe 
was a short cDNA insert that was isolated from a PLC/PRF/5 human hepatoma cell-line cDNA library generated as 
described (Watson and Jackson, 1 986). This library was hybridized with an RAR p-derived probe (nucleotides 550 to 
760) corresponding to the conserved DNA-binding domain of RAR p. A weakly hybridizing plaque was purified, sub- 
cloned into M13mp18: and sequenced by the dideoxy procedure. This clone was found to be identical to RAR a and 
extended from nucleotides 358 to 587, corresponding to the C and D domains (Giguere et al., 1 987). Since this cDNA 
insert contains some regions homologous to the RAR p cDNA, cross-hybridization has "been occasionally observed, 
particularly in cell-lines that overexpress RAR p mRNAs. 

B.6.4. Hybridization procedure . 

The two cDNA inserts were subcloned into Ml 3 and used to generate high specific activity (greater than 10^ c. p. 
m./|ig) single^stranded probes by elongation of a sequencing primer with 32p labelled dTTP (3000 Ci/mmol) and un- 
labelled nucleotides by Klenow polymerase. The resulting double-stranded DNA was digested using a unique site in 
the vector, f ractioned on a urea/acrylamide sequencing gel, and the labelled single-stranded insert etectroeluted. These 
probes (5x10^ cpm/ml) were hybridized to the filters in 7% (w/v) sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 6.5, 
1 mM ethylenediaminetetraacetate (EDTA), and 1 mg/ml bovine serum albumin (BSA) at 68°C overnight. The filters 
were washed in 1% SDS, 50 mM NaCI, 1 mM EDTA at 68'C for 10 min and autoradiographed at -70'C using Kodak 
XAR films and intensifying screens. A mouse p actin probe was used to rehybridize the filters and check that all lanes 
contained equal amounts of RNA. 

B.6.5. Nuclear run-on experiments . 

Nuclear transcript elongation assays were performed as described (Mezger et al., 1 987). PLC/PRF/5 or HEPG 2 
cells (10^) were challenged with ethanol or with 10'^ M RA for 6 hours in charcoal-stripped medium. After isolation of 
the nuclei, transcription was performed in a final volume of 100 ^1 (microliters) with 150 uCi (microcuries) of (a^sp) 
DTP (3000 Ci/mmol). Typical incorporation ranged between 2 and 6x10^ cpm. The labelled RNA was hybridized to 
nylon filters (Amersham) containing 10 |ig and 1 )ig of a 3' end RAR p cDNA insert (position 2495 to 2992, de The et 
al., 1987) cloned in Ml 3; 20 |j.g of the same insert in the reverse orientation were included as a negative control. A 
plasmid containing a mouse p actin insert (4 \ig) provided a positive and quantitative hybridization control. Hybridization 
was performed with a probe concentration of 2-6x1 0^ cpm/ml for 48 hours. 

The relative intensity of hybridization signals In Northern -blotting and run-on experiments was estimated using a 
Hoefer scanning densitometer and the appropriate computer program. 

Our results showing a direct autoregulation of the transcription of the RAR-p gene implies that the retinoic acid 
receptor p binds to its own gene promoter sequences. To identity those sequences, several 5' coterminal RAR-p cDNA 
clones were derived from the PRF/PLC/5 library previously described. Nucleotide sequence analysis showed that these 
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clones extended our previous X 13 RAR-p clone by 72 bp, which are shown in Figure 15. Thus, this invention also 
provides the 72 bp nucleotide sequence shown in Fig. 15, as well as a cloned DNA sequence encoding a polypeptide 
of hap gene, wherein the sequence has the formula 

CCCATGC 

GAGCTGTTTGAAGGACTGGGATGCCGXGAACGCGAGCGATCCGAGCAGGGTTTGTCTGCCCACCGT 
ATGTTTGACTGTATGGATGTTCTGTCAGTGAGTCCTGGGCAAATCCTGATTCTACACTGCGAGTCC 
GTCTTCCTGCATGCTCCAGGAGAAAGCTCTCAAAGCATGCTTCAGTGGATTGACCCAAACCGAATC 

GCAGCATCGGCACACTGCtCAATCAATTGAAACACAGAGCACCAGCTCTGAGGAACTCGTCCCAAG 

CCCCCCATCTCCACTTCCTCCCCCTCGAGTGATCAAACCCTGCTTCGTCTGCCAGGACAAATCATC 

AGGGTACCACTATGGGGTCAGCGCCTGTGAGGGATGAACGGCTTTTTCCGCAGAAGTATTCAGAAG 

AATATGATTTACACTTGTCACCGAGATAAGAACTGTGTTATTAATAAAGTCACCAGGAATCGATGC 

CAATACTGTCGACTCCAGAAGTGCTTTGAAGTGGGAATGTCCAAAGAATCTGTCAGCAATGACAGG 

AACAAGAAAAAGAAGGAGACTTCGAAGCAAGAATGCACAGAGAGCTATGAAATGACAGCTGAGTTG 

GACGATCTCACAGAGAAGATCCGAAAAGCTCACCAGGAAACTTTCCCTTCACTCTCGCAGCTGGGT 

AAATACACCACGAATTCCAGTGCTGACCATCGAGTCCGACTGGACCTGGGCCTCTGGGACAAATTC 

AGTGAACTGGCCACCAAGTGCATTATTAAGATCGTGGAGTrTGCTAAACGTCTGCCTGGTTTCACT 

GGCTTGACCATCGCAGACCAAATTACCCTGCTGAAGGCCGCCTCCCTGGACATCCTGATTCTTAGA. 

ATTTGCACCAGGTATACCCCAGAACAAGACACCATGACTTTCTCAGACGGCCTTACCCTAXATCGA 

ACTCAGATGCACAATGCTGGATTTGGTCCTCTGACTGACCTTGTGTTCACCT'TTGCCAACCAGCTC 

CTGCCTTTGGAAATGGATGACACAGAAACAGGCCTTCTCAGTGCCATCTGCTTAATCTGTGGAGAC 

CGCCAGGACCTTGAGGAACCGACAAAAGTAGATAAGCTACAAGAACCATTGCTGGAAGCACTAAAA 

ATTTATATCAGAAAAAGACGACCCAGCAACCCTCACATGTTTCCAAAGATCTTAATGAAAATCACA 

GATCTCCCTAGCATCAGTGCTAAAGGTGCAGAGCGTGTAATTACCTTGAAAATGGAAATTCCTGGA 

TCAATGCCACCTCTCATTCAAGAAATGATGGAGAATTCTGAAGGACATGAACCCTTGACCCCAAGT 

TCAAGTGGGAACACAGCAGAGCACAGTCCTAGCATCTCACCCAGCTCAGTGGAAAACAG7GGGGTC 

AGTCAGTC ACCACTCGTGCAAT AA , 

and serotypic variants thereof, wherein said DNA is in a purified fornn. 

This 72 bp sequence was used as a probe to screen a human genomic library. Six overlapping clones were derived, 
and a 6 kb HindlH - Bam HI insert containing the probe was subcloned into PT2 16 at the same sites to give rise to the 
plasmid pPROHAP. Since this genomic DNA insert is limited by the Bam HI site present in the original X 13 clone and 
contains the additional 72 bp of the 5' end of the mRNA, it also contains the promoter region and all the elements 
necessary for the RAR-p gene expression and regulation. Preliminary SI analysis using the plasmid pPROHAP end 
labelled at the Bam HI site suggest that the cloned RAR-p cDNA are full-size and that the cap site is indeed located in 
the 90 bp BamHI-EcpRI fragment. 
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A compiete restriction map of the Hindill-BamHI genomic DNA insert is shown in Figure 16. 

Plasmid pPROHAP was transfected into the E, coli strain DHSaF' (from B.R.L). A viable culture of E. coli strain 
DH5aF' transformed with plasmid pPROHAP was deposited on November 29, 1988, with the National Collection of 
Cultures of Microorganisms or Collection Nationals de Cultures de Micro-organisms (C.N.C.M.) of Institut Pasteur, 
5 Paris, France, under Culture Collection Accession No. C.N. CM. 1-821 . 

This DNA insert, which is characterized by its restriction map and partial nucleotide sequence (or some of its 
fragment), provides a tool to assess RAR-p function, because it must contain a RAR responsive enhancer Several 
constructs In which this promoter region controls the expression of indicator genes, such as the p-galactosidase or the 
chloramphenicol acetyl transferase (CAT), have been designed. Transient or stable expression, in eucaryotic cells, of 
^0 these constructs, together with an expression vector of RAR-p, provides a useful model system to directly assess 
stimulation of RAR-P by a retinoid. 

Thus, this invention also provides a recombinant DNA-molecute comprising a DNA sequence of coding for a retinoic 
acid receptor, said DNA sequence coding on expression in a unicellular host for a polypeptide displaying the retinoic 
acid and DNA binding properties of RAR-p and being operatively linked to an expression control sequence in said DNA 
IS molecule. 

It should be apparent that the foregoing techniques as well as other techniques known in the field of medicinal 
chemistry can be employed to assay for agonists and antagonists of ligand binding to RAR-p and binding of the RAR- 
p protein to DNA. Specifically this invention makes it possible to assay for a substance that enhances the interaction 
of the ligand, the RAR-p protein, and the DNA, or combinations of these materials to elicit an observable or measurable 

20 response. The substance can be an endogenous physiological substance or it can be a natural or synthetic drug. 

This invention also makes it possible to assay for an antagonist that inhibits the effect of an agonist, but has no 
biological activity of its own in the RAR-P effector system. Thus, for example, the invention can be employed to assay 
for a natural or synthetic substance that competes for the same receptor site on the RAR-p protein or the DNA that 
the agonist occupies, or the invention can be employed to assay for a substance that can act on an allosteric site, 

25 which may result in allosteric inhibition. 

It will be understood that this invention is not limited to assaying for substances that interact only in a particular 
way, but rather the invention Is applicable to assaying for natural or synthetic substances, which can act on one or 
more of the receptor or recognition sites, including agonist binding sites, competitive antagonist binding sites (accessory 
sites), and non-competitive antagonist or regulatory binding sites (allosteric sites). 

30 A convenient procedure for carrying out the method of the invention involves assaying a system for stimulation of 

RAR-p by a retinoid. For example, as a retinoid binds to the receptor, the receptor-ligand complex wilt bind to the 
responsive promotor sequences and will activate transcription. For example, transcription of the p-galactosidase or 
CAT genes can be determined. The method of this invention makes it possible to screen p-receptor binding retinoids. 
In addition, this invention makes it possible to carry out blood tests for RAR-p activity in patients. 

35 In summary, a hepatitis B virus (HBV) integration in a 1 47 bp cellular DNA fragment homologous to steroid receptors 

and c-erb A /thyroid hormone receptor genes previously isolated from a human hepatocellular carcinoma (HCC) was 
used as a probe to clone the corresponding complementary DNA from a human liver cDNA library. The nucleotide 
sequence analysis revealed that the overall structure of the cellular gene, named hap, is similar to that of DNA-binding 
hormone receptors. That is, it displays two highly conserved regions identified as the putative DNA-binding and hor- 

40 mone-binding domains of the c-erb A /steroid receptors. Six out of seven hepatoma and hepatoma-de rived cell-lines 
express a 2.5 kb hag mRNA species which is undetectable In normal adult and fetal livers but present in all non-hepatic 
tissues analyzed. Low stringency hybridization experiments revealed the existence of hag related genes in the human 
genome. Taken together, the data suggest that the hap product may be a member of a new family of ligand-responsive 
regulatory proteins whose inappropriate expression in liver seems to correlate with the hepatocellular transformed state. 

45 Because the known receptors control the expression of target genes that are crucial for cellular growth and differ- 

entiation, an altered receptor could participate in the cell transformation. In that sense, avian v-erbA oncogene, which 
does not by itself induce neoplasms in animals, potentiates the erythroblast transformant effect of v-erbB and other 
oncogenes of the src family (Kahn etal., 1986). It has been shown that the v-erbA protein has lost its hormone-binding 
potential (Sap et a!., 1 986), presumably as a result of one or several mutations it has accumulated in its putative ligand- 

50 binding domain. It has been also suggested (Edwards et al., 1979) that the growth of human breast tumors are corre- 
lated to the presence of significant levels of ER. This invention may provide a novel example in which a DNA-binding 
protein would again relate to the oncogenic transformation by interfering with the transcriptional regulation of target 
genes. DNA-transfection assays using the native hag cDNA as well as 'altered' hap genes derived from various HCC 
can provide important information concerning any transforming capacity. 

55 Following is a more detailed identification of the literature citations appearing above in parenthesis: 

Beasley, R.P., and Hwang, L.Y (1984). Epidemiology of Hepatocellular Carcinoma In Viral Hepatitis and Liver 
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Claims 
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A polypeptide comprising an amino acid sequence which is named hap protein and which consists of the following 
amino acid sequence: 



50 
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Met Phe Asp Cys Met Asp Val 
lie Leu Asp Phe Tyr Thr Ala 
Gin Glu Lys Ala Leu Lys Ala 
Thr Glu Trp Gin His Arg His 
Gin Ser Thr Ser Ser Glu Glu 
Pro Leu Pro Pro Pro Arg Val 
Gin Asp Lys Ser Ser Gly Tyr 
Glu Gly Cys Lys Gly Phe Phe 
Met lie Tyr Thr Cys His Arg 
Lys Val Thr Arg Asn Arg Cys 
Cys Phe Glu Val Gly Met Ser 
Arg Asn Lys Lys Lys Lys Glu 
Glu Ser Tyr Glu Met Thr Ala 
Lys lie Arg Lys Ala His Gin 
Gin Leu Gly Lys Tyr Thr Thr 
Val Arg Leu Asp Leu Gly Leu 
Ala Thr Lys Cys lie lie Lys 
Leu Pro Gly Phe Thr Gly Leu 
Leu Leu Lys Ala Ala Cys Leu 
Cys Thr Arg Tyr Thr Pro Glu 
Asp Gly Leu Thr Leu Asn Arg 
Phe Gly Pro Leu Thr Asp Leu 
Leu Leu Pro Leu Glu Met Asp 
Ser Ala lie Cys Leu lie Cys 
Glu Pro Thr Lys Val Asp Lys 
Ala Leu Lys lie Tyr lie Arg 
His Met Phe Pro Lys He Leu 
Ser He Ser Ala Lys Gly Ala 
Met Glu He Pro Gly Ser Met 
Met Glu Asn Ser Glu Gly His 



Leu 


Ser 


Val 


Ser 


Pro 


Gly 


Gin 


Ser 


Pro 


Ser 


Ser 


Cys 


Met 


Leu 


Cys 


Phe 


Ser 


Gly 


Leu 


Thr 


Gin 


Thr 


Ala 


Gin 


Ser 


He 


Glu 


Thr 


Leu 


Val 


Pro 


Ser 


Pro 


Pro 


Ser 


Tyr 


Lys 


Pro 


Cys 


Phe 


Val 


Cys 


His 


Tyr Gly 


Val 


Ser 


Ala 


Cys 


Arg 


Arg 


Ser 


He 


Gin 


Lys 


Asn 


Asp 


Lys 


Asn 


Cys 


Val 


He 


Asn 


Gin 


Tyr 


Cys 


Arg 


Leu 


Gin 


Lys 


Lys 


Glu 


Ser 


Val 


Arg 


Asn 


Asp 


Thr 


Ser 


Lys 


Gin 


Glu 


Cys 


Thr 


Glu 


Leu 


Asp 


Asp 


Leu 


Thr 


Glu 


Glu 


Thr 


Phe 


Pro 


Ser 


Leu 


Cys 


Asn 


Ser 


Ser 


Ala 


Asp 


His 


Arg 


Trp 


Asp 


Lys 


Phe 


Ser 


Glu 


Leu 


He 


Val 


Glu 


Phe 


Ala 


Lys 


Arg 


Thr 


He 


Ala 


Asp 


Gin 


He 


Thr 


Asp 


He 


Leu 


He 


Leu 


Arg 


He 


Gin 


Asp 


Thr 


Met 


Thr 


Phe 


Ser 


Thr 


Gin 


Met 


His 


Asn 


Ala 


Gly 


Val 


Phe 


Thr 


Phe 


Ala 


Asn 


Gin 


Asp 


Thr 


Glu 


Thr Gly 


Leu 


Leu 


Gly 


Asp 


Arg 


Gin 


Asp 


Leu 


Glu 


Leu 


Gin 


Glu 


Pro 


Leu 


Leu 


Glu 


Lys 


Arg Arg 


Pro 


Ser 


Lys 


Pro 


Met 


Lys 


He 


Thr 


Asp 


Leu 


Arg 


Glu 


Arg 


Val 


He 


Thr 


Leu 


Lys 


Pro 


Pro 


Leu 


He 


Gin 


Glu 


Met 


Glu 


Pro 


Leu 


Thr 


Pro 


Ser 


Ser 



Ser Gly Asn Thr Ala Glu His Ser Pro Ser He Ser Pro Ser 
Ser Val Glu Asn Ser Gly Val Ser Gin Ser Pro Leu Val Gin 

or serotypic variants of this sequence, 

provided that this polypeptide displays the retinoic acid and DNA binding properties of RAR-beta. 

Polypeptide as clainned in claim 1, which is free from human blood derived proteins, virus, viral protein, human 
tissues and human tissue components. 
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A polypeptide selected from the group consisting in the polypeptides (a) to (g) : 



(a) 




























Gin 


His 


Arg 


His 


Thr 


Ala 


Gin 


Ser 


He 


GIU 


inr 


Gin 


Ser 


1 nr 


Ser 


Ser 


Glu 


Glu 


Leu 


Val 


Pro 


Ser 


Pro 


Pro 


Ser 


Pro 


Leu 


Pro 


Pro 


Pro 


Arg 


Val 


Tyr 


Lys 


Pro 


Cys 


Phe 


Val 


Cys 


Gin 


Asp 


Lys 


Ser 


Ser 


Gly 


Tyr 


His 


Tyr 


Gly Val 


Ser 


Ala 


Cys 


CjIU 


^ 1 IF 

CaXy 


Cys 


Lys 


Gly 


Phe 


Phe 


Arg 


Arg 


Ser 


He 


Gin 


Lys 


X 4-1 M 

Asn 


Met 


T 1 

lie 


Tyr 


Thr 


Cys 


His 


Arg 


Asp 


Lys 


Asn 


Cys 


Val 


He 


Asn 


Lys 


Val 


Thr 


Arg 


Asn 


Arg 


Cys 


Gin 


Tyr 


Cys 


Arg 


Leu 


Gin 


Lys 


Cys 


Phe 


Glu 


Val 


Gly 


Met 


Ser 


Lys 


Glu 


Ser 


Val 


Arg 


Asn 


Asp 


Arg 


Asn 


Lys 


Lys 


Lys 


Lys 


Glu 


Thr 


Ser 


Lys 


Gin 


Glu 


Cys 


Thr 


Glu 


Ser 


Tyr 


Glu 


Met 


Thr 


Ala 


Glu 


Leu 


Asp 


Asp 


Leu 


Thr 


Glu 


Lys 


He 


Arg 


Lys 


Ala 


His 


Gin 


Glu 


Thr 


Phe 


Pro 


Ser 


Leu 


Cys 








(b) 




























Val 


Arg 


Asn 


Asp 


Arg 


Asn 


Lys 


Lys 


Lys 


Lys 


Glu 


Thr 


Ser 


Lys 


Gin 


Glu 


Cys 


(peptide 1) 


• 

















(c) 

Asn Asp Arg Asn Lys Lys Lys Lys Glu Thr Cys (peptide 
2); 

(d) 

Cys Gly Val Ser Gin Ser Pro Leu Val Gin (peptide 3) ; 
(e) 

Ala Glu Leu Asp Asp Leu Thr Glu Lys He Arg 
(f) 

Met Phe Asp Cys Met Asp Val Leu Ser Val Ser Pro Gly Gin 
He Leu Asp Phe Tyr Thr Ala Ser Pro Ser Ser Cys Met Leu 
Gin Glu Lys Ala Leu Lys Ala Cys Phe Ser Gly Leu Thr Gin 
Thr Glu Trp Gin His Arg His Thr Ala Gin Ser 

(g) 

His Glu Pro Leu Thr Pro Ser Ser Ser Gly Asn Thr Ala Glu 
His Ser Pro Ser He Ser Pro Ser Ser Val Glu Asn Ser Gly 
Val Ser Gin Ser Pro Leu Val Gin 

or any serolypic variant of one of the said polypeptides (a) to (g), the polypeptide or the serolypic variant having 
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binding capacity with retinoic acid. 

A cloned DMA sequence comprising a sequence encoding for a polypeptide as claimed in claims 1 and 2, the said 
cloned sequence comprising the following formula (a1 ) ; 

ATG TTT GAC TGT ATG GAT GTT CTG TCA GTG AGT CCT GGG CAA 
ATC CTG GAT TTC TAG ACT GCG AGT CCG TCT TCC TGC ATG CTC 
CAG GAG AAA GOT CTC AAA GCA TGC TTC AGT GGA TTG ACC CAA 
ACC GAA TGG CAG CAT CGG CAC ACT GCT CAA TCA ATT GAA ACA 
CAG AGC ACC AGC TCT GAG GAA CTC GTC CCA AGC CCC CCA TCT 
CCA CTT CCT CCC CCT CGA ,GTG TAC AAA CCC TGC TTC GTC TGC 
CAG GAC AAA TCA TCA GGG TAC CAC TAT GGG GTC AGC GCC TGT 
GAG GGA TGT AAG GGC TTT TTC CGC AGA AGT ATT CAG AAG AAT 
ATG ATT TAC ACT TGT CAC CGA GAT AAG AAC TGT GTT ATT AAT 
AAA GTC ACC AGG AAT CGA TGC CAA TAC TGT CGA CTC CAG AAG 
TGC TTT GAA GTG GGA ATG TCC AAA GAA TCT GTC AGG AAT GAC 
AGG AAC AAG AAA AAG AAG GAG ACT TCG AAG CAA GAA TGC ACA 
GAG AGC TAT GAA ATG ACA GCT GAG TTG GAC GAT CTC ACA GAG 
AAG ATC CGA AAA GCT CAC CAG GAA ACT TTC CCT TCA CTC TGC 
CAG CTG GGT AAA TAC ACC ACG AAT TCC AGT GCT GAC CAT CGA 
GTC CGA CTG GAC CTG GGC CTC TGG GAC AAA TTC AGT GAA CTG 
GCC ACC AAG TGC ATT ATT AAG ATC GTG GAG TTT GCT AAA CGT 

CTG CCT GGT TTC ACT GGC TTG ACC ATC GCA GAC CAA ATT ACC 
CTG CTG AAG GCC GCC TGC CTG GAC ATC CTG ATT CTT AGA ATT 
TGC ACC AGG TAT ACC CCA GAA CAA GAC ACC ATG ACT TTC TCA 
GAC GGC CTT ACC CTA AAT CGA ACT CAG ATG CAC AAT GCT GGA 
TTT GGT CCT CTG ACT GAC CTT GTG TTC ACC TTT GCC AAC CAG 
CTC CTG CCT TTG GAA ATG GAT GAC ACA GAA ACA GGC CTT CTC 
AGT GCC ATC TGC TTA ATC TGT GGA GAC CGC CAG GAC CTT GAG 
GAA CCG ACA AAA GTA GAT AAG CTA CAA GAA CCA TTG CTG GAA 
GCA CTA AAA ATT TAT ATC AGA AAA AGA CGA CCC AGC AAG CCT 
CAC ATG TTT CCA AAG ATC TTA ATG AAA ATC ACA GAT CTC CGT 
AGC ATC AGT GCT AAA GGT GCA GAG CGT GTA ATT ACC TTG AAA 
ATG GAA ATT CCT GGA TCA ATG CCA CCT CTC ATT CAA GAA ATG 
ATG GAG AAT TCT GAA GGA CAT GAA CCC TTG ACC CCA AGT TCA 
AGT GGG AAC ACA GCA GAG CAC AGT CCT AGC ATC TCA CCC AGC 
TCA GTG GAA AAC AGT GGG GTC AGT CAG TCA CCA CTC GTG CAA 
TAA, 



26 



EP 0 321 362 B1 

or degeneracy variants of this formula encoding the said polypeptide. 

5. DNA sequence as claimed in claim 4. which is free of human serum protein components, such as human tissue 
or serum proteins. 

6. DNA sequence having any of the following formulae (b1) to (g1) corresponding respectively to the aminoacid 
sequences (b) to (g) of the claim 3 : 

(bi) 

GTC AGG AAT GAC AGG AAC AAG AAA AAG AAG GAG ACT TCG AAG 
CAA GAA TGC; 

(Cl) 

AAT GAC AGG AAC AAG AAA AAG AAG GAG ACT; 
(dl) 

GGG GTC ACT CAG TCA CCA CTC GTG CAA; 
(el) 



GCT 


GAG 


TTG 


GAC 


CAT 


CTC 


ACA 


GAG 


AAG 


ATC 


CGA; 








(fl) 




























ATG 


TTT 


GAC 


TGT 


ATG 


GAT 


GTT 


CTG 


TCA 


GTG 


AGT 


CCT 


GGG 


CAA 


ATC 


CTC 


GAT 


TTC 


TAC 


ACT 


GCG 


AGT 


CCG 


TCT 


TCC 


TGC 


ATG 


CTC 


CAG 


GAG 


AAA 


GCT 


CTC 


AAA 


GCA 


TGC 


TTC 


AGT 


GGA 


TTG 


ACC 


CAA 


ACC 


GAA 


TGG 


CAG 


CAT 


CGG 


CAC 


ACT 


GCT 


CAA 


TCA; 








(gi) 




























CAT 


GAA 


CCC 


TTG 


ACC 


CCA 


AGT 


TCA 


AGT 


GGG 


AAC 


ACA 


GCA 


GAG 


CAC 


ACT 


CCT 


AGC 


ATC 


TCA 


CCC 


AGC 


TCA 


GTG 


GAA 


AAC 


AGT 


GGG 


GTC 


ACT 


CAG 


TCA 


CCA 


CTC 


GTG 


CAA; 













or degeneracy variants of these formulae. 



7. A DNA probe consisting essentially of a radio-nucleotide bonded to the DNA sequenceof any of claims 4 or 5. 

8. A hybrid duplex molecule consisting essentially of the DNA sequence of claims 4 or 5, hydrogen bonded to a 
nucleotide sequence of complementary base sequence. 

9. Hybrid duplex molecule as claimed in claim 8, wherein said nucleotide sequence is either a DNA sequence, or a 
RNA sequence. 

10. A process for selecting, from a group of nucleotide sequences, a nucleotide sequence, e.g. a DNA sequence or 
a RNA sequence, said nucleotide sequence being preferably labelled, e.g. by a radionucleotide, said nucleotide 
sequence coding for thyroid or steroid hormone receptor, or for RAR-p or a portion thereof, said process comprising 
the step of determining which of said nucleotide sequences hybridizes to a DNA sequence as claimed In claims 
4 to 6. 
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11. Prcxess as claimed in claim 10, wherein said nucleotide sequence Is selected by Southern blot technique, under 
high stringency conditions performed as follows: 24 h prehybridization in 50% formamide, 5x Denhardt, 5x SSC, 
300 up/ml denatured salmon sperm DNA, at 40''C; 48 h hybridization with 35% formamide, 5x Denhardt, 5x SSC, 
10% Dextran sulfate, 2.10^ cpm/ml denatured labelled DNA probe (specific activity 5.10® cpm/ug); washes in 

5 0.!x SSC. 0.1 SDS, 55^*0 for 30 minutes. 

1 2. Process as claimed in claim 1 0, wherein said nucleotide sequence is selected by Northern blot technique according 
to Maniatts, T, Fritch, E., and Sambrook, J. (1982), Molecular cloning: a laboratory manual (Cold Spring Harbor, 
New- York: Cold Spring Harbor Laboratory). 

10 

13. An _E. coli bacterial culture in a purified form, wherein the culture comprises E. coli cells containing DNA, wherein 
a portion of said DNA comprises the DNA sequence as claimed in claim 4. 

14. A method for assaying a fluid for the presence of an agonist or antagonist to thyroid or steroid hormone receptor, 
^5 wherein the method comprises: 

(A) providing an aqueous solution containing a known concentration of the proteinaceous receptor as claimed 
in claim 1; 

(B) incubating the receptor with the fluid suspected of containing the agonist or antagonist under conditions 
20 sufficient to bind the receptor to the agonist or antagonist; and 

(C) determining whether there is change in concentration of the proteinaceous receptor in the aqueous solution. 

1 5. A method for assaying a fluid for the presence of an agonist or antagonist to retlnoic acid receptor RAR-p, wherein 
the method comprises: 

25 

(A) providing an aqueous solution containing a known concentration of the proteinaceous receptor as claimed 
in claim 1; 

(B) incubating the receptor with the fluid suspected of containing the agonist or antagonist under conditions 
sufficient to bind the receptor to the agonist or antagonist; and 

30 (C) determining whether there is change in concentration of the proteinaceous receptor in the aqueous solution. 

16. Method as claimed in claim 15, wherein the receptor and the agonist or antagonist form a complex. 

1 7. Method as claimed in claim 1 5, wherein a cross-linking agent is present in an amount sufficient to inhibit dissociation 
35 of the receptor and the agonist or antagonist. 

18. A recombinant DNA molecule comprising a DNA sequence of coding for a retinoic acid receptor, said DNA se- 
quence coding on expression in a unicellular host for a polypeptide displaying the retinoic acid and DNA binding 
properties of RAR-p and being operatively linked to an expression control sequence in said DNA molecule. 

40 

19. The recombinant DNA molecule of claim 18 wherein the DNA sequence is any one of claims 4 to 6. 

20. Plasmid pPROHAP, deposited at the CNCM on November 29, 1988, under n' t-821. 

^ 21. Bacterial culture as claimed in claim 13, wherein said cells are comprised of E. coli strain DHSaP, deposited at 
the CNCM on November 29, 1988, under n' 1-821 . 

22. Use of polypeptides according to claims 1 to 3, as a reagent to obtain antibodies for diagnostic purposes. 

50 

Patentanspruche 

1 . Polypeptid, das eine Aminosauresequenz umfafBt, das hap-ProteIn genannt wird und das aus der folgenden Ami- 
nosauresequenz: 

55 
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Met Phe Asp Cys Met Asp Val Leu Ser Val Ser Pro Gly Gin 
lie Leu Asp Phe Tyr Thr Ala Ser Pro Ser Ser Cys Met Leu 
Gin Glu Lys Ala Leu Lys Ala Cys Phe Ser Gly Leu Thr Gin 
Thr Glu Trp Gin His Arg His Thr Ala Gin Ser He Glu Thr 
Gin Ser Thr Ser Ser Glu Glu Leu Val Pro Ser Pro Pro Ser 
Pro Leu Pro Pro Pro Arg Val Tyr Lys Pro Cys Phe Val Cys 
Gin Asp Lys Ser Ser Gly Tyr His Tyr Gly Val Ser Ala Cys 
Glu Gly cys Lys Gly Phe Phe Arg Arg Ser He Gin Lys Asn 
Met He Tyr Thr Cys His Arg Asp Lys Asn Cys Val He Asn 
Lys Val Thr Arg Asn Arg Cys Gin Tyr Cys Arg Leu Gin Lys 
cys Phe Glu Val Gly Met Ser Lys Glu Ser Val Arg Asn Asp 
Arg Asn Lys Lys Lys Lys Glu Thr Ser Lys Gin Glu Cys Thr 
Glu Ser Tyr Glu Met Thr Ala Glu Leu Asp Asp Leu Thr Glu 
Lys He Arg Lys Ala His Gin Glu Thr Phe Pro Ser Leu Cys 
Gin Leu Gly Lys Tyr Thr Thr Asn Ser Ser Ala Asp His Arg 
Val Arg Leu Asp Leu Gly Leu Trp Asp Lys Phe Ser Glu Leu 
Ala Thr Lys' Cys He He Lys He Val Glu Phe Ala Lys Arg 
Leu Pro Gly Phe Thr Gly Leu Thr He Ala Asp Gin He Thr 
Leu Leu Lys Ala Ala Cys Leu Asp He Leu He Leu Arg He 
Cys Thr Arg Tyr Thr Pro Glu Gin Asp Thr Met Thr Phe Ser 
Asp Gly Leu Thr Leu Asn Arg Thr Gin Met His Asn Ala Gly 
Phe Gly Pro Leu Thr Asp Leu Val Phe Thr Phe Ala Asn Gin 
Leu Leu Pro Leu Glu Met Asp Asp Thr Glu Thr Gly Leu Leu 
Ser Ala He Cys Leu He Cys Gly Asp Arg Gin Asp Leu Glu 
Glu Pro Thr Lys val Asp Lys Leu Gin Glu Pro Leu Leu Glu 
Ala Leu Lys He Tyr He Arg Lys Arg Arg Pro Ser Lys Pro 
His Met Phe Pro Lys He Leu Met Lys He Thr Asp Leu Arg 
Ser He Ser Ala Lys Gly Ala Glu Arg Val He Thr Leu Lys 
Met Glu He Pro Gly Ser Met Pro Pro Leu He Gin Glu Met 
Met Glu Asn Ser Glu Gly His Glu Pro Leu Thr Pro Ser Ser 

Ser Gly Asn Thr Ala Glu His Ser Pro Ser He Ser Pro Ser 
Ser Val Glu Asn Ser Gly Val Ser Gin Ser Pro Leu Val Gin 

Oder serotypischen Varianten dieser Sequenz besteht mit der MafBgabe, 6a(i dieses Polypeptid die Retinoesaure- 
und DNA-Blndungseigenschaften von RAR-beta zeigt. 

Polypeptid nach Anspruch 1 , das frei von aus menschlichem Blut abgeteiteten bzw. abgeleitetem Protelnen, Virus, 
viralem Protein, menschlichem Gewebe und menschlichen Gewebebestandteilen ist. 

Polypeptid, das aus der Gruppe ausgewahit ist, die aus den Polypeptiden (a) bis (g): 
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(a) 



Gin 


His Arg His 


Thr 


Ala Gin Ser 


He 


Glu 


Thr 


Gin 


Ser 


Thr 


Ser 


Ser Glu Glu 


Leu 


Val Pro Ser 


Pro 


Pro 


Ser 


Pro 


Leu 


Pro 


Pro 


Pro Arg Val 


Tyr 


Lys Pro Cys 


Phe 


Val 


Cys 


Gin 


Asp 


Lys 


Ser 


Ser Gly Tyr 


His 


Tyr Gly Val 


Ser 


Ala 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly Phe Phe 


Arg 


Arg Ser lie 


Gin 


Lys 


Asn 


Met 


He 


Tyr 


Thr 


Cys His Arg 


Asp 


Lys Asn Cys 


Val 


He 


Asn 


Lys 


Val 


Thr 


Arg 


Asn Arg Cys 


Gin 


Tyr Cvs Aro 


Leu 


Gin 




vy i» 


rne 


rill 


Val 


Gly Met Ser 


Lys 


Glu Ser Val 


Arq 


Asn 


Asp 


Arg 


Asn 




Lys 


Lys Lys Glu 


Thr 




VJX u 


Cys 


xnr 


Glu 


Ser 


Tyr 


Glu 


Met Thr Ala 


Glu 


Leu Asp Asp 


Leu 


Thr 


Glu 


Lys 


He 


Arg 


Lys Ala His Gin 


Glu 


Thr Phe Pro 


Ser 


Leu 


Cys 








(b) 




















Val 


Arg Asn Asp 


Arg 


Asn Lys Lys 


Lys 


Lys 


Glu 


Thr 


Ser 


Lys 


Gin Glu Cys ( Peptid 


1 ); 















(c) 

Asn Asp Arg Asn Lys Lys Lys Lys Glu Thr Cys (Peptid 
2) ; 

(d) 

Cys Gly Val' Ser Gin Ser Pro Leu Val Gin ( Peotid 3 ) ; 



(e) 

Ala Glu Leu Asp Asp Leu Thr Glu Lys He Arg 
(f) 

Met Phe Asp Cys Met Asp Val Leu Ser Val Ser Pro Gly Gin 

He Lou Asp Phe Tyr Thr Ala Ser Pro Ser Ser Cys Met Leu 

Gin Glu Lys Ala Leu Lys Ala Cys Phe Ser Gly Leu Thr Gin 

Thr Glu Trp Gin His Arg His Thr Ala Gin Ser 

(g) 

His Glu Pro Leu Thr Pro Ser Ser Ser Gly Asn Thr Ala Glu 

His Ser Pro Ser He Ser Pro Ser Ser Val Glu Asn Ser Gly 

Val Ser Gin Ser Pro Leu Val Gin 

Oder einer jeden serotypischen Variante eines dieser Polypeptide (a) bis (g) besteht, wobei das Polypeptid Oder 
die serotypische Variante eine Bindungsfahigkeit zu Retinoesaure besitzen. 

Geklonte DNA-Sequenz, die eine Sequenz umfaf3t, die fur ein Polypeptid nach den Anspruchen 1 und 2 codiert, 
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wobei die geklonte Sequenz die folgende Formel (a1): 



ATG 


TTT 


GAC 


TGT 


ATG 


GAT 


GTT 


CTG 




GIG 


AG 1 


L.Gi 


GGG 


V-AA 


ATC 


CTG 


GAT 


TTC 


TAP 


APT 


GCG 


AGT 




TCT 




iGC 


AiAJ 


G L U 




GAG 


AAA 


GCT 


PTP 


AAA 


GCA 


TGC 


iTC 


AGT 


GGA 


TTG 


ACC 


CAA 


Apr* 


GAA 


TGG 


PAG 


PAT 




CAC 


ACT 


GCT 


CAA 


TCA 


ATT 


.GAA 


ACA 


CAG 


AGC 


ACC 


AGC 


tpt 


GAG 

VJA\J 


GAA 


CTC 


Vj 1 v.. 


UUA 


AGC 


CCC 


CtA 


T/*T 
XCl 


CCA 


CTT 


CCT 


CCC 


CCT 


CGA 


GTG TAC 


& A A 
AAA 


ppp 


IGC 




Gi V- 


1 uG 


CAG 


GAC 


AAA 


TCA 


TCA 


GGG 


TAC 


CAC 


TAT 
1 Ai 




GTC 


AGC 


GCC 


iGi 




CGA 


TGT 


AAG 


GGP 


ill 


TTC 


CGC 


AQA 


AGT 


ATT 


CAG 


AAG 


AAT 


ATG 


ATT 


TAG 


ACT 


TGT 


PAP 


CGA 


GAT 


AAG 


AAC 


TGT 


GTT 


ATT 


AAT 


AAA 


GTC 


ACC 


AGG 


AAT 


^wA 


TGC 


CAA 


TAC 


TGT 


CGA 


CTC 


CAG 


AAG 


TCP 


TTT 
ill 




GTG 




AlG 


TCC 


AAA 


GAA 


TCT 


GTC 


AGG 


AAT 


GAC 


AGG 


AAC 


AAG 


Ik n n 

AAA 


AAG 


AAG 


GAG 


ACT 


TCG 


AAG 


CAA 


GAA 


TGC 


ACA 


GAG 


AGC 


TAT 


CAA 


ATG 


ACA 


GCT 


GAG 


TTG 


GAC 


GAT 


CTC 


ACA 


GAG 


AAG 


ATC 


CGA 


AAA 


GCT 


CAC 


CAG 


GAA 


ACT 


rrc 


CCT 


TCA 


CTC 


TGC 


CAG 


CTG 


GGT 


AAA 


TAC 


ACC 


ACG 


AAT 


TCC 


AGT 


GCT 


GAC 


CAT 


CGA 


GTC 


CGA 


CTG 


GAC 


CTG 


GGC 


CTC 


TGG 


GAC 


AAA 


TTC 


AGT 


GAA 


CTG 


GCC 


ACC 


AAG 


TGC 


ATT 


ATT 


AAG 


ATC 


GTG 


GAG 


TVT 


GCT 


AAA 


CGT 



CTG 


CCT 


GGT 


TTC 


ACT 


GGC 


TTG 


ACC 


ATC 


GCA 


GAC 


CAA 


ATT 


ACC 


CTG 


CTG 


AAG 


GCC 


GCC 


TGC 


CTG 


GAC 


ATC 


CTG 


ATT 


CTT 


AGA 


ATT 


TGC 


ACC 


AGG 


TAT 


ACC 


CCA 


GAA 


CAA 


GAC 


ACC 


ATG 


ACT 


TTC 


TCA 


GAC 


GGC 


CTT 


ACC 


CTA 


AAT 


CGA 


ACT 


CAG 


ATG 


CAC 


AAT 


GCT 


GGA 


TTT 


GGT 


CCT 


CTG 


ACT 


GAC 


CTT 


GTG 


TTC 


ACC 


TTT 


GCC 


AAC 


CAG 


CTC 


CTG 


CCT 


TTG 


GAA 


ATG 


GAT 


GAC 


ACA 


GAA 


ACA 


GGC 


CTT 


CTC 


AGT 


GCC 


ATC 


TGC 


TTA 


ATC 


TGT 


GGA 


GAC 


CGC 


CAG 


GAC 


CTT 


GAG 


GAA 


CCG 


ACA 


AAA 


GTA 


GAT 


AAG 


CTA 


CAA 


GAA 


CCA 


TTG 


CTG 


GAA 


GCA 


CTA 


AAA 


ATT 


TAT 


ATC 


AGA 


AAA 


AGA 


CGA 


CCC 


AGC 


AAG 


CCT 


CAC 


ATG 


TTT 


CCA 


AAG 


ATC 


TTA 


ATG 


AAA 


ATC 


ACA 


GAT 


CTC 


CGT 


AGC 


ATC 


AGT 


GCT 


AAA 


GGT 


GCA 


GAG 


CGT 


GTA 


ATT 


ACC 


TTG 


AAA 


ATG 


GAA 


ATT 


CCT 


GGA 


TCA 


ATG 


CCA 


CCT 


CTC 


ATT 


CAA 


GAA 


ATG 


ATG 


GAG 


AAT 


TCT 


GAA 


GGA 


CAT 


GAA 


CCC 


TTG 


ACC 


CCA 


AGT 


TCA 


AGT 


GGG 


AAC 


ACA 


GCA 


GAG 


CAC 


AGT 


CCT 


AGC 


ATC 


TCA 


CCC 


AGC 


TCA 


GTG 


GAA 


AAC 


AGT 


GGG 


GTC 


AGT 


CAG 


TCA 


CCA 


CTC 


GTG 


CAA 



TAA, 

Oder Degenerationsvarianten dieser Formel, die das Polypeptid codieren, umfa3t. 
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5. DNA-Sequenz nach Anspruch 4, die frei von menschlichen Serumproteinbestandteilen, wie menschlichem Gewe- 
be Oder Serumproteine, ist. 

6. DNA-Sequenz mit einer jeden der folgenden Formein (b 1 ) bis (g1 ), welche jeweils den Anninosauresequenzen (b) 
5 bis (g) von Anspruch 3: 

(bl) 

GTC AGG AAT GAC AGG AAC AAG AAA AAG AAG GAG ACT TCG AAG 

10 

CAA GAA TGC; 
(cl) 

,5 AAT GAC AGG AAC AAG AAA AAG AAG GAG ACT; 

(dl) 

GGG GTC ACT CAG TCA CCA CTC GTG CAA; 

20 

(el) 





GCT 


GAG 


TTG 


GAC 


CAT 


CTC 


ACA 


GAG 


AAG 


ATC 


CGA; 








25 


(fl) 






























ATG 


TTT 


GAC 


TGT 


ATG 


GAT 


GTT 


CTG 


TCA 


GTG 


AGT 


CCT 


GGG 


CAA 




ATC 


CTC 


GAT 


TTC 


TAC 


ACT 


GCG 


AGT 


CCG 


TCT 


TCC 


TGC 


ATG 


CTC 




CAG 


GAG 


AAA 


GCT 


CTC 


AAA 


GCA 


TGC 


TTC 


AGT 


GGA 


TTG 


ACC 


CAA 


30 


ACC 


GAA 


TGG 


CAG 


CAT 


CGG 


CAC 


ACT 


GCT 


CAA 


TCA; 










(gi) 




























35 


CAT 


GAA 


CCC 


TTG 


ACC 


CCA 


AGT 


TCA 


AGT 


GGG 


AAC 


ACA 


GCA 


GAG 




CAC 


ACT 


CCT 


AGC 


ATC 


TCA 


CCC 


AGC 


TCA 


GTG 


GAA 


AAC 


AGT 


GGG 




GTC 


ACT 


CAG 


TCA 


CCA 


CTC 


GTG 


CAA; 















Oder Degenerationsvarianten dieser Formein entsprechen. 

40 

7. DN A-Sonde, die im wesentlichen aus einena an die DNA-Sequenz nach einem der Anspruche 4 oder 5 gebundenen 
Radionukleotid besteht. 

8. Hybrid-Duplexnaolekul, das inn wesentlichen aus der DNA-Sequenz der Anspruche 4 Oder 5 besteht, welche Ober 
45 Wasserstoff an eine Nukleotidsequenz mit komplementarer Basensequenz gebunden ist. 

9. Hybrid-Duplexmolekul nach Anspruch 8, worin die Nukleotidsequenz entweder eine DNA-Sequenz oder eine RNA- 
Sequenz ist. 

50 10. Verfahren zum Selektieren einer Nukleotidsequenz, z.B. einer DNA-Sequenz oder einer RNA-Sequenz aus einer 
Gruppe von Nukleotidsequenzen, wobei die Nukleotidsequenz vorzugsweise markiert ist, beispielsweise durch 
ein Radionukleotid, diese Nukleotidsequenz fur SchilddrOsen- oder Steroidhormonrezeptor oder fur RAR-p oder 
einen Abschnitt davon codierl, wobei das Verfahren den Schritt der Bestimmung, welche Nukleotidsequenzen mit 
einer in den Anspruchen 4 bis 6 beanspruchten DNA-Sequenz hybridisiert, umfafBt. 

55 

11. Verfahren nach Anspruch 10, worin die Nucleotidsequenz durch Southern-Blot-Technik unter hochgenauen Be- 
dingungen, die wie folgt durchgefuhrt werden, selektiert wird: 24 h Vorhybridisierung in 50 % Formamid, 5 x Den- 
hardt, 5 x SSC, 300 |ip/ml denaturierte Salmsperma-DNA, bet 40*0; 48 h Hybridisieren mit 35 % Formamid, 5 x 
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Denhardt, 5 x SSC, 10 % Dextransutfat, 2x10^ cpm/ml denaturierte ^Sp-markierte DNA-Sonde (spezifische Ak- 
tivitat 5 X 10Scpm/^g); Waschen in 0,1 x SSC, 0.1 SDS, bei 55'C, 30 Minuten. 

12. Verfahren nach Anspruch 10, worin die Nucleotidsequenz durch North ern-Blot-Technik nach Maniatis, T... Fritch, 
E. und Sambrook, J. (1982), Molecular cloning: a laboratory manual (Cold Spring Harbor, New-York: Cold Spring 
Harbor Laboratory) selektiert wird. 

13. E, coli Bakterienkultur in gereinigter Form, worin die Kultur DNA enthaltende E, coli Zellen umfaf^t, wobei ein 
Abschnitt dieser DNA die in Anspruch 4 beanspruchte DNA-Sequenz umfaOt. 

14. Verfahren zur Untersuchung einer FlOssigkeit auf die Anwesenheit eines Agonisten oder Antagonisten fur Schild- 
drusen- oder Steroidhormonrszeptor, wobei das Verfahren umfa3t: 

(A) das Vorsehen einer waBrigen Losung, welche eine bekannte Konzentration des In Anspruch 1 beanspruch- 
ten proteinartigen Rezeptors enthatt; 

(B) das Inkubieren des Rezeptors mit der FlOssigkeit, von der angenommen wird, da(3 sie den Agonisten oder 
Antagonisten enthalt, unter Bedingungen, welche ausreichen, um den Rezeptor an den Agonisten oder Ant- 
agonisten zu binden, und 

(C) das Feststellen, ob eine Konzentrationsanderung des proteinartigen Rezeptors in der waOrigen Losung 
eintritt. 

15. Verfahren zur Untersuchung einer FlOssigkeit auf die Anwesenheit eines Agonisten oder Antagonisten fur Retl- 
noesaurerezeptor RAR-p, wobei das Verfahren umfaBt: 

(A) das Vorsehen einer waBrigen Losung, welche eine bekannte Konzentration des in Anspruch 1 beanspruch- 
ten proteinartigen Rezeptors enthatt, 

(B) das Inkubieren des Rezeptors mit der FlOssigkeit, von der angenommen wird, da3 sie den Agonisten oder 
Antagonisten enthalt, unter Bedingungen, welche ausreichen, um den Rezeptor an den Agonisten oder Ant- 
agonisten zu binden, und 

(C) das Feststellen, ob eine Konzentrationsanderung des proteinartigen Rezeptors in der waBrlgen Losung 
eintritt. 

16. Verfahren nach Anspruch 15, wobei der Rezeptor und der Agonist oder Antagonist einen Komplex bilden. 

17. Verfahren nach Anspruch 15, wobei ein Vernetzungsmittel in einer Menge vorliegt, welche ausreichend ist, um 
eine Dissoziation des Rezeptors und des Agonisten Oder Antagonisten zu verhindern. 

18. Rekombinantes DNA-Molekul, das eine DNA-Sequenz umfa3t, die fur Retinoesaurerezeptor codiert, wobei die 
DNA-Sequenz auf Expression in einem einzelligen Wirt fur ein Polypeptid codiert. das die Retinoesaure- und DNA- 
Bindungseigenschaften von RAR-p zeigt und wirksam mit einer Expressionskontrollsequenz in dem DNA-Molekul 
verbunden ist. 

19. Rekombinantes DNA-Molekul nach Anspruch 18, wobei die DNA-Sequenz eine jede der Anspruche 4 bis 6 ist. 

20. Plasmid pPROHAP, am 29. November 1988, bei CNCM unter der Nummer 1-821 hinterlegt. 

21 . Bakterienkultur nach Anspruch 1 3, wobei die Zellen aus dem E, coli-Stamm DH5aF' zusammengesetzt sind, der 
bei CNCM am 29. November 1988 unter der Nummer 1-821 hinterlegt wurde. 

22. Verwendung der Polypeptide nach den Anspruchen 1 bis 3 als Reagenz zum Erhalten von Antikorpern fur Dia- 
gnosezwecke. 



Revendicatlons 

1 . Polypeptide comprenant une sequence d'acldes amines, d6nomme proteine hap et constitu6 de la sequence d'aci- 
des amines suivante: 
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Met 






cya 


Mat Aap 


Val 


Lau 


Sar 


Val 


Sar 


Pro 


Cly 


Gin 


lis 




Aap 


Pha 


Tyr Thr 


Ala 


Sar 


Pro 


Sar 


Sar 


Cvs 


Hat 


Lau 


Gin 


GIU 


Lya 


Aia 


Lau Lya 


Ala 


cya 


Pha 


sar 


civ 


tiSU 


Thr 


Gin 


TUP 


GIU 


rrp 




Him. kra 


Hia 


Ttir 


Ala Gin 


Sar 


Ila 


Glu 


Thr 


Gin 


SAT 


Tbr 


Sar 




Clu 




val 


Pro 


Sat 


Pro 


Pro 


Sar 




T A«t 








Val 


TW 

*y* 


Lya 


Pro 


Cvm 


Pha 


Val 


Cvs 






i*y« 








His 


Tyr Gly 


Val 


Sar 


Ala 


Cvs 


n 


Giy 


cy» 




Gly Pha 


Pha 




Arg 


sar 


Ila 


Gin 


Lys 


Asn 


n>% 




Tyr 




vjm mm 


Arg 


ASp 


Lya 


Asn 


Cvm 
wy» 


Val 


Ila 


Asn 






Tnr 


Arq 


Aan Arg 


wya 


GXn 


Tyr 


Cys 




Lau 


Gin 


LVS 




Iram 


GlU 


va* 


oiy nac 




**ys 


Clu 


Sar 


Val 




Asn 


ASB 






T vm 

iy» 


*<ya 


Lya Lya 


Glu 




Sar 


Ly* 


Gin 


Glu 


Cvs 


Thr 


Clu 


»■* 


Tyr 


w AU 




Ala 


Glu 


Lau 


Asp 




Lau 


Thr 


Glu 


L.wa 

i*y» 


**9 




ty» 


&1* Hi A 


Gin 


Glu 


Thr 


Pha 


Pro 


Sar 


Lau 


cvs 


Gin 




civ 


T vm 


4 J * * *»* 


Thr 


Asn 


sar 


sar 


Ala 


ASO 


His 


Aro 


Val 






ASp 


Lau Gly 


Lau 


Ttd 


Asp 


Lys 


Pha 


Sar 


Glu 


Lau 


xia 








Ila Xla 


LVB 


Ila 


val 


Glu 


Pha 


Ala 


Lys 


Arg 




Pro 


civ 


Pha 


Thr Gly 


Lau 


Thr 


Ila 


Ala 


Asp 


Gin 


Ila 


Thr 


L*u 






Xla 


Ala Cya 


Lau 


Aao 


Ila 


Lau 


Ila 


Lau 


Ar^ 


Il« 


wy« 


Ttir 




Tyr 


Thr Pro 


Glu 


Gin 


Asp Thr 


Mat 


Thr 


Pha 


Sar 




(•Ay 


Lau 


Thr 








Gin 


Nat 


Hia 


Asn 


Ala 


Clv 




eiy 




Lau 


TOr Asp 


Lau 


Val 


Pha 


Thr 


Pha 


Ala 


Asn 


cm 


L8U 


Lau 


Pro 


Lau 


Glu Mat 


Aap 


Asp 


Thr 


Glu 


Thr 


Gly 


L«u 


L«u 


5«r 


Xla 


Ila 


cya 


Uu Ila 


cya 


Gly 


Asp Arg 


cm 


Asp 


Lau 


Glu 


Glu 


Pro 


Thr 


Ly» 


Val Aap 


Lya 


Lau 


Gin 


Glu 


Pro 


Lau 


Lou 


Glu 


Ala 


Lftu 


Ly» 


Ila 


Tyr Ila 


Ar9 


Lys 


Arg Arg 


Pro 


sar 


Lys 


Pro 


His 


Hat 


Pha 


Pro 


Lya Ila 


Lau 


Hat 


Lya 


Ila 


Thr 


Asp 


Lau 


Arg 


5«r 


Ila 


Sar 


Ala 


Lys Gly 


Ala 


Glu 


Arg val 


Ila 


Thr 


Lau 


Lys 


Mat 


61u 


Ila 


Pro 


Gly Sar 


Mat 


Pro 


Pro 


Lau 


Ila 


Gin 


Glu 


Mat 


M«t 


Clu 


kmn 


Sar 


Glu Gly 


His 


Glu 


Pro 


Lau 


Thr 


Pro 


Sar 


Sar 



Ser Gly Asn Thr Ala Glu His Ser Pro Ser He Ser Pro Ser 
Ser Val GLu Asn Ser Gly Val Ser Gin Ser Pro Leu Val Gin 

ou des variants s^rotypiques de cette sequence, 

^ condition que ce polypeptide presents les propri6t6s de liaison de Tactde r6tinoique et de I'ADN de RAR-bdta. 

Polypeptide selon la revendication 1 , caract6ris6 en ce qu'il est depourvu de proteines, de virus, de prot6ine virale, 
de tissus hunnains et de composants de tissus humains Issus du sang humain. 

Polypeptide choisi dans le groupe constitud des polypeptides (a) ^ (g) : 
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(&) 


















Gin 


Kift Arg His Thr Ala Gin 


Ser 


11 


Glu 


Thr 


Gin 


S r 


Thr 


Sftr 


Sar Glu Glu Lau Val Pro 


Ser 


Pro 


Pro 


Ser 


Pro 


Leu 


Pro 


Pro 


Pro Xrg Val Tyr Lys Pro 


Cys 


Phe 


Val 


cys 


Gin 


Asp 


Lys 


Ser 


S«r Gly Tyr His Tyr Gly 


Val 


Ser 


Ala 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly Phtt Phe Arg Arg Ser 


lie 


Gin 


Lys 

* 


Asn 


Met 


He 


Tyr 


Thr 


Cys His Arg Asp Lys Asn 


cys 


Val 


ii« 


Asn 


Lys 


Val 


Thr 


krq 


Asn Arg Cys Gin Tyr cys 


kxq 


Leu 


Gin 


Lys 


Cys 


Phe 


Glu 


vai 


Gly n6C ser Lys Glu Ssr 


Val 


Arg 


Asn 


Asp 


Arg 


Asn 


Lys 


Lyt 


Lys Lys Glu Thr Ser Lys 


Gin 


Glu 


cys 


Thr 


Glu 


Ser 


Tyr 


Glu 


Net Thr Ala Glu Leu Asp 


Asp 


Leu 


Thr 


Glu 


Lys 


He 


Arg 


Lys 


Ala His Gin Glu Thr Phe 


Pro 


Ser 


Leu 


Cys 








(b) 


















Val 


Arg Asn Asp Arg Asn Lys 


Lys 


Lys 


Lys 


Glu 


Thr 


Ser 


Lys 


Gin 


Glu Cys (peptide 1) ; 

















(c) 

Xsn Asp Arg Asn Lys Lys Lys Lys Glu Thr Cys (peptide 

2); 



Cys Gly Val Ser Gin Ser Pro Leu Val Gin (peptide 3) ; 
(•) 

Ala Glu Leu Asp Asp Leu Thr Glu Lys He Arg 



(f) 



Met 
He 
Gin 
Thr 


Phe 
Leu 
Glu 
Glu 


Asp 
Asp 
Lys 
Trp 


cys 
Phe 
Ala 
Gin 


Met 
Tyr 
Leu 
His 


Asp 
Thr 
Lys 
Arg 


Val 
Ala 
Ala 
His 


Leu 
Ser 
Cys 
Thr 


Ser 
Pro 
Phe 
Ala 


val 
Ser 
Ser 
Gin 


Ser 
ser 
Gly 
Ser 


Pro 
Cys 
Leu 


Gly 
Met 
Thr 


Gin 
Leu 
Gin 


(g) 

His 
His 
Val 


Glu 
Ser 
Ser 


Pro 
Pro 
Gin 


Leu 
Ser 
Ser 


Thr 
He 
Pro 


Pro 
Ser 
Leu 


Ser 
Pro 
Val 


Ser 
Ser 
Gin 


Ser Gly 
Ser Val 


Asn 
Glu 


Thr 
Asn 


Ala 
Ser 


Glu 
Gly 



ou tout variant ser^otypique de Tun desdits polypeptides (a) ^ (g), le polypeptide et le variant s6r6otypique ayant 
une capacity de liaison vis-^-vis de I'acide retinoTque. 

S6quence d'ADN clon6e, comprenant une sequence codant pour un polypeptide selon les revendications 1 et 2, 
ladite sequence clon^e comprenant la fornnule suivante (a1): 
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TTT 


QAu 


TGT 


ATG 


GAT 


CTT 








TTC 


TAw 


ACT 


GCG 






AAA 


Gr*r 


wTw 


AAA 


GCA 










mT 
wAX 


CGG 


CAC 






AWW 


AUW 


TCT 


GAG 


GAA 






CCT 


WWW 


wwX 


CGA GTG 






AAA 


TCX 

X WA 


X WA 


GGG 


TAC 






TCT 


AAv 


vipw 


TTT 


TTC 




ATT 


XAW 


AWX 


lux 


CAC 


CGA 


All 




AWW 


A\»V9 


XXT 
AAi 


CGA TGC 




TTT 


uAA 




uUA 


ATG 


TCC 


AGO 


AAC 


AAG 


AAA 


AAG 


AAG 


GAG 


GAG 


AGC 


TAT 


GAA 


ATG 


ACA 


GCT 


AAG 


ATC 


CGA 


AAA 


GCT 


CAC 


CAG 


CAG 


CTG 


GGT 


AAA 


TAG 


ACC 


ACG 


GTC 


CGA 


CTG 


GAC 


CTG 


GGC 


CTC 


GCC 


ACC 


AAG 


TGC 


ATT 


ATT 


AAG 



CTG 


Tr*A 
X WA 


uXw 


ACT 
AwX 


CCT 
WW X 


www 




AwX 


wwu 


TCT 


TCC 
X WW 


TCC 
XwW 


ATC 
AXw 




TCP 
X 


TTC 


ACT 
AuX 


CCA 

WUA 


TTG 


ACC 
AUw 


SaAA 


AWX 


CPT 

WWX 


CAA 


TCA 

X WA 


ATT 

AX X 


CAA 


AUIA 


wx w 


V X w 


CCA 

WWA 


ACC 

AVW 


CCC 
www 


CCA 
WWA 


TCT 


TXC 

X AW 


AAA 


CCC 
www 


TCC 

X VJW 


TTC 
X X w 


CTC 
wx w 




WAW 


TAT 

X AX 




CTC 

WAW 


ACC 

AwW 


CCC 
www 


TftT 


r*Gff 

WuW 


AGA 

AUA 


ACT 

AVX 


ATT 

AX X 


CAC 


AAC 

AAW 


AAX 


CAT 

WAX 


AAC 


AAC 

lAW 


TGT 


CTT 

WA X 


ATT 

AX X 


AAX 


^^AA 


TAC 

X AW 


TCT 
X wx 


CCA 

WwA 


CTC 
wx w 


CAC 

WAW 




AAA 


CAA 

VAA 


TCT 


CTC 
uX W 


ACC 

AWW 


AAT 

AAX 




ACT 


TCG 


AAG 


CAA 


GAA 


TGC 


ACA 


GAG 


TTG 


GAC 


GAT 


CTC 


ACA 


CAG 


GAA 


ACT 


TTC 


CCT 


TCA 


CTC 


TGC 


AAT 


TCC 


AGT 


GCT 


GAC 


CAT 


CGA 


TGG 


GAC 


AAA 


TTC 


AGT 


GAA 


CTG 


ATC 


GTG 


GAG 


TTT 


GCT 


AAA 


CGT 



CTG 


CCT 


GGT 


TTC 


ACT 


GGC 


TTG 


ACC 


ATC 


GCA 


GAC 


CAA 


ATT 


ACC 


CTG- 


CTG 


AAG 


GCC 


GCC 


TGC 


CTG 


GAC 


ATC 


CTG 


ATT 


CTT 


AGA 


ATT 


TGC 


ACC 


AGG 


TAT 


ACC 


CCA 


GAA 


CAA 


GAC 


ACC 


ATG 


ACT 


TTC 


TCA 


GAC 


GGC 


CTT 


ACC 


CTA 


AAT 


CGA 


ACT 


CAG 


ATG 


CAC 


AAT 


GCT 


GGA 


TTT 


GGT 


CCT 


CTG 


ACT 


GAC 


CTT 


GTG 


TTC 


ACC 


TTT 


GCC 


AAC 


CAG 


CTC 


CTG 


CCT 


TTG 


GAA 


ATG 


GAT 


GAC 


ACA 


GAA 


ACA 


GGC 


CTT 


CTC 


AGT 


GCC 


ATC 


TGC 


TTA 


ATC 


TGT 


GGA 


GAC 


CGC 


CAG 


GAC 


CTT 


GAG 


GAA 


CCG 


ACA 


AAA 


GTA 


GAT 


AAG 


CTA 


CAA 


GAA 


CCA 


TTG 


CTG 


GAA 


GCA 


CTA 


AAA 


ATT 


TAT 


ATC 


AGA 


AAA 


AGA 


CGA 


CCC 


AGC 


AAG 


CCT 


CAC 


ATG 


TTT 


CCA 


AAG 


ATC 


TTA 


ATG 


AAA 


ATC 


ACA 


GAT 


CTC 


CGT 


AGC 


ATC 


AGT 


GCT 


AAA 


GGT 


GCA 


GAG 


CGT 


GTA 


ATT 


ACC 


TTG 


AAA 


ATG 


GAA 


ATT 


CCT 


GGA 


TCA 


ATG 


CCA 


CCT 


CTC 


ATT 


CAA 


GAA 


•ATG 


ATG 


GAG 


AAT 


TCT 


GAA 


GGA 


CAT 


GAA 


CCC 


TTG 


ACC 


CCA 


AGT 


TCA 


AGT 


GGG 


AAC 


ACA 


GCA 


GAG 


CAC 


AGT 


CCT 


AGC 


ATC 


TCA 


CCC 


AGC 


TCA 


GTG 


GAA 


AAC 


AGT 


GGG 


GTC 


AGT 


CAG 


TCA 


CCA 


CTC 


GTG 


CAA 



TAA, 

ou des variants de d^g^ndrescence de cette fornnule codant pour ledit polypeptide. 

Sequence d'ADN selon la revendication 4, caractdrisde en ce qu'elle est d^pourvue de composants prot^iqi 
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du serum humain, tels que des proteines du tissu ou du serum humain. 

6. Sequence d'ADN ayant I'une quelconque des fomnules (b1) ^ (g1) suivantes, correspondant respectivement aux 
sequences d'acides amines (b) ^ (g) selon la revendication 3: 

(bl) 

CTC AGO AAT GAC ACG AAC AAG AAA AAG AAG GAG ACT TCG AAG 
CAA GAA TGC; 

(cl) 

AAT GAC AGG AAC AAG AAA AAG AAG GAG ACT; 



(dl) 

GGG GTC ACT CAG TCA CCA CTC GTG CAA; 



(el) 

GCT GAG TTG GAC CAT CTC ACA GAG AAG ATC CGA: 
(fl) 

ATG TTT GAC TGT ATG GAT CTT CTG TCA GTG AGT CCT GGG CAA 

ATC CTC GAT TTC TAC ACT GCG AGT CCG TCT TCC TGC ATG CTC 

CAG GAG AAA GCT CTC AAA GCA TGC TTC AGT GGA TTG ACC CAA 

ACC GAA TCG CAG CAT CGG CAC ACT GCT CAA TCA; 

(gi) 

CAT GAA CCC TTG ACC CCA AGT TCA AGT GGG AAC ACA GCA GAG 
CAC ACT CCT AGC ATC TCA CCC AGC TCA GTG GAA AAC AGT GGG 
GTC ACT CAG TCA CCA CTC GTC CAA; 

ou des variants de d6g6n6rescence de ces formules. 

7. Sonde d'ADN constitu6e essentiellement d'un radionucl6otide li^ ^ la s6quence d'ADN de Tune quelconque des 
revendications 4 ou 5. 

8. Molecule duplex hybride constituee essentiellement d'une sequence d'ADN selon la revendication 4 ou 5, liee par 
liaison hydrogdne k une sequence nucldotidique de la sequence de bases compldmentaire. 

9. Molecule duplex hybride selon la revendication 8, caract^risda en ce que ladite sequence nucleotidique est sott 
une sequence d'ADN sort une sequence d'ARN. 

10. Proc^d^ de selection, dans un groupe de sequences nucldotidiques, d'une sequence nucleotidique. par ex. une 
sequence d'ADN ou une sequence d'ARN, ladite sequence nucl6otidlque 6tant de preference nnarquee, par ex. 
par un radionucteotide, ladite sequence nucleotidique codant pour un r6cepteur de I'hormone thyroidienne ou 
steroide, ou pour le RAR-p ou une partie de celui-ci, ledit proc6de comprenant les etapes consistant k determiner 
laquelle desdites sequences nucieotidiques s'hybride ^ une sequence d'ADN selon les revendications 4^6. 

11. Precede selon la revendication 10, caracterise en ce que ladite sequence nucleotidique est choisie par la technique 
de Southern-blot, dans des conditions de forte stringence realisees de la manidre suivante: 24h de pr6hybridation 
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dans du formamide a 50%. 5x Denhardt, 5x SSC, 300 lag/ml d'ADN de sperme de saumon denature, a 40'C; 
hybridation de 48 h avec du formamide k 35%, 5x Denhardt, 5x SSC, sulfate de Dextrane ^ 10%. 2.10^ cpnrVml 
de sonde d'ADN d6natur6e marquee au ^^p (activit6 sp6cifique S.IO^cpm/iig); lavage dansO,1x SSC, 0,1% SDS, 
55^C pendant 30 minutes. 

5 

1 2. Proced6 seion ta revendication 1 0, caract6rise en ce que ladite sequence nucleotidique est choisie par la technique 
de Northern-blot selon Maniatis, T, Fritch, E., et Sambrook, J. (1982), Molecular cloning: a laboratory manual 
(Cold Spring Harbor, New- York: Cold Spring Harbor Laboratory). 

^0 13. Culture bacterienne de E. Coli sous forme purifi^e, caract6risee en ce que la culture comprend des cellules de^ 
Colt contenant de I'ADN, et en ce qu'une partle dudit ADN comprend la sequence d'ADN selon la revendication 4. 

14. M6thode de dosage d'un liquide pour la presence d'un agoniste ou d'un antagonlste du r6cepteur de I'hormone 
tyrotdienne ou steroide, caracterisee en ce que la methode comprend: 

15 

(A) Tobtentlon d'une solution aqueuse contenant une quantite connue du r6cepteur prot6ique selon la reven- 
dication 1 ; 

(B) I'incubation du recepteur avec le liquide soupgonne de contenir I'agoniste ou I'antagoniste, dans des con- 
ditions suffisantes pour Her le recepteur ^ I'agoniste ou k I'antagoniste; et 

20 (C) la determination d'un changement 6ventuel dans ia concentration du recepteur prot^ique dans la solution 

aqueuse. 

15. Methode de dosage d'un liquide pour la presence d'un agoniste ou d'un antagoniste du recepteur de I'acide reti- 
noique RAR-p, caract6rls6e en ce que la methode comprend: 

25 

(A) I'obtention d'une solution aqueuse contenant une quantity connue du recepteur proteique selon la reven- 
dication 1; 

(B) I'incubation du recepteur avec le liquide soup?onn6 de contenir I'agoniste ou I'antagoniste, dans des con- 
ditions suffisantes pour lier le recepteur k I'agoniste ou k I'antagoniste; et 

30 (C) la determination d'un changement dventuel dans la concentration du recepteur prot6ique dans la solution 

aqueuse. 

16. Methode selon la revendication 15, caract6ris6e en ce que le r6cepteur et I'agoniste ou I'antagoniste torment un 
complexe. 

35 

17. Methode selon la revendication 15, caracterisee en ce qu'un agent de reticulation est present dans une quantite 
sutflsante pour inhiber la dissociation du recepteur et de I'agoniste ou de I'antagoniste. 

18. Molecule d'ADN recombinant comprenant une sequence d'ADN codant pour un recepteur de I'acide retinoique, 
40 ladite sequence d'ADN codant, par expression dans un hdte unicellulaire, pour un polypeptide presentant les 

proprietes de liaison de I'acide retinoique et de I'ADN de RAR-p et liee de maniere fonctionnelle k une sequence 
de controle d'expression dans ladite molecule d'ADN. 

19. Molecule d'ADN recombinant selon la revendication 18. caracterisee en ce que ladite sequence d'ADN est I'une 
45 quelconque des revendicatlons 4 a 6. 

20. Plasmide pPROHAP, d6pos6 aupres de la CNCM le 29 novembre 1988 sous le rf 1-821. 

21. Culture bacterienne selon la revendication 13, caracterisee en ce que lesdites cellules sont constituees de la 
so souche de E. Coli DHSaF'. depos6e aupres de la CNCM le 29 novembre 1 988 sous le n* 1-821 . 

22. Utilisation des polypeptides selon les revendicatlons 1 k 3, comme reactif pour obtenir des anticorps k des fins de 
diagnostic. 

55 
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